Gene Haur_1120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1120 
Symbol 
ID5733012 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp1282621 
End bp1283781 
Gene Length1161 bp 
Protein Length386 aa 
Translation table11 
GC content50% 
IMG OID641278259 
Productglycosyl transferase group 1 
Protein accessionYP_001543896 
Protein GI159897649 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.233547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATAT TATCTGCCTT GACCTACTAC GCGCCGCACT GGACGGGCTT GACCATGCAT 
GCCCAACGGG TGGCTGAAGG ACTCGCAGCT CGTGGACACC ACGTCACCGT CTTAACAATC
CAGCATGAGC CAACGCTGCC AACTGAAGAA ATCTTGAATG GGGTGCATGT ATTGCGGCTT
AAGCCTGCGG CTCAAATTAG CCGTGGCATG CTCGCACTCA ATTTTCCATT TGTCGCCGCC
AAATTGATTC GTAGCCACGA TGTGGTGCAT GTGCATACGC CCCAACTTGA AGCCTTGCTG
CTTTCAGGGT TATGCCGCGT GCTGAATAAG CCCTTGTTGA TGAGCCATCA TGGCGATTTG
GTCATGCCAA CTGGCTTGAT TAATCGGGCG ATCGAGAAGG TCATGATTGG CCAGATGGTT
TTAGCAGGCA AATTGGCGCG GCGCGTTAGT GCCTATAGTC GCGATTACGC CGCAAACTCC
AGCTTTTTGC AAAAATTCAC CAAAAAATTG ACGTATATTT ACCCACCGGT TGATTTACCA
ACCCCCAATC CAAGCCAAGT AGCCGCTTGG AAAGCCGAAC TTGGGATCAG CGATAAGCCG
ATTGTTGGCT TTGCTGGGCG CTTCGTTGAA GAAAAAGGCT TTGATTTCTT GCTCAAAGCT
ATGCCAATGA TCGCCGAGGT CTTTCCTGAG GTGCGCTTTG TGTTTGCGGG CGAGCACAAA
ATGGTCTATG AAGATTTTTA TTCCACATGT TTGCCGCTGA TCGAGCAAAA CCGTGAACGA
ATTGTGTTCC TTGGTTTGTT GCGTGATTCG CAAAAACTTG CCAATTTTTA TGCAATGTGC
GACTTGTTTA CCTTGCCCAG CCGCACCGAT TGTTTGGCGA TGGTGCAGAT CGAGGCTTTA
CTGGCGGGCA CGCCGTTGGT CACCAGCGAT ATTCCAGGCG CACGGGTTGT CGTGCAGGAA
ACTGGCTTTG GGCGCTTGGT GCAAACTCAA AATCCTCGCG CCTTAGCCGA TGGGATTATT
GAAGTGCTGA AAAACCCTGA AACCTATAAA GTGCAACCCG CCAAAGTTGA ACAAGTCTTT
TCAGTCAAAA CCATTCTCGA TAGCTACGAG CGGACGATGG CCGAAATGTG TGGTCAGCCC
GTTTCTGCCT CGGTTGTATA A
 
Protein sequence
MKILSALTYY APHWTGLTMH AQRVAEGLAA RGHHVTVLTI QHEPTLPTEE ILNGVHVLRL 
KPAAQISRGM LALNFPFVAA KLIRSHDVVH VHTPQLEALL LSGLCRVLNK PLLMSHHGDL
VMPTGLINRA IEKVMIGQMV LAGKLARRVS AYSRDYAANS SFLQKFTKKL TYIYPPVDLP
TPNPSQVAAW KAELGISDKP IVGFAGRFVE EKGFDFLLKA MPMIAEVFPE VRFVFAGEHK
MVYEDFYSTC LPLIEQNRER IVFLGLLRDS QKLANFYAMC DLFTLPSRTD CLAMVQIEAL
LAGTPLVTSD IPGARVVVQE TGFGRLVQTQ NPRALADGII EVLKNPETYK VQPAKVEQVF
SVKTILDSYE RTMAEMCGQP VSASVV