Gene Haur_2976 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2976 
Symbol 
ID5734848 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3754215 
End bp3757499 
Gene Length3285 bp 
Protein Length1094 aa 
Translation table11 
GC content52% 
IMG OID641280120 
Productabortive infection protein 
Protein accessionYP_001545742 
Protein GI159899495 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.292884 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCATAAGC CCGATACCAA ACGACTGATT CCCCCAATTT TTGTTCTTTT AGCAATATTT 
GGCATTTTCT TAGCGATTGT GTTTGCCGCA GAATCAACGC CCATGGCCGA GATTCGCTTT
AATCTGGATC GTGAGGCGGC TTTACAACGC TCAGTCGAAG CATTACAGGC AGCGGGTGGC
GATCCCAGCC AGTTTACCCA AACAATCACT TTTGGCTCGA ACAACGATGC TCAATCGTAT
CTGATTCGCG AACGCAGCCG CGCACTGCTC AACCAACGGG TTGACGAAGA TCTTAATCTG
GCCAGTTGGA ATGTGCGGTT TTGGCGTGAG CTTGATCCAG AGCAATGGTC GATTAGTCTT
TCACCAGCCA CAGGCCGCAT CTTGGCAATT AATCATATGC GCCCTGATGA AGCGGCTGGA
GCAACGTTGA GCCAAACCGA GGCCTTGCTT CTGGCCCAAG CTCAATTGCC AATTCCGCTT
GATCAATTAA CCTTGCTCGA TCAATTTACC AATCAACAGC CCAATCGTAC CGACCATACC
TTTATTTGGC AGCGCCGTGA TATCAGCGAT GCTGAGGCTC AATATCGCTA CAGTGTTACA
ATTGCTGGCG ATCAATTGGG CCAGCGTGGC GAGTATTATT GGCTGCCCCA ATCGTGGTAT
TTGGATAAAG ATTGGCAACT GCGCCGTGGT GGTATTCTCA ACCACACTGG CTGGACGCTG
ACCTATGCCC TGACTGCCTT GATTGGCGTG GCTTGGTTTA TTCAGGCACG GCGGGGGCGA
TTACGCTGGC GTTGGGCCTT ACGCCTGTTT GCAGCGGTCG CGGCGGTCGG CGTGTTGGTC
ATGCTCAACA GCATTCCGCT CGATTTGGCT CATTATGATA TTAACCAAAG CCTGCCGGTT
TACTGGGGCA ATGTACTTGG TGGTTATGTT GGTCAACTGG TCACAATCGC CACCACAATT
ATTTTGGCGG GCATGGCTGG CGAGGCGCTA GTTTGGGAAG AAACCGAGGG CACTGTTTCG
TTGAGCGAAA CCTTGACCCA CCGTGGGTTA GTGAGTCGCC CAGTCGTGCA GGCTTTATGG
GTTGGCGGCT TGGTTGGAAT TTTCCAGCTA GGCTTTGTTT CGGCCTTTTA TGCCTTGGGT
AGCCGCTATT TTGGGGTTTG GTCGCCCGTT ACGCCCTTGT ATGATGATAC GATTGCTACG
CCGTTTCCCG CCTTGTATGG CATGGCGCTG GGCTTGTTGC CCGCGATCGG CGAGGAATTG
ATCTTTCGCT TGGGCGGGAT TACGGTGTTG ACCCGTTATT TTGGTCGGCC CAAGCTGGCA
ATCATTGTGA CAGCCGTGGT TTGGGCCGCA CTGCATGCCA CCTATCCCCA ACATCCGTTT
TATATTCGCG TGGTTGAATT AAGCATTATC GGGATTTTGT TTGGCTTTTT GAGCGTGCGC
TACGGCGTGT TGGCCTCAAT CGCGGCCCAC TATACCTACA ATGCCTCGCT GTATGTGCCA
CTTTTTTGGA AAACCAATAA CTTCTATTTG CTTAGCGGGA TTGCCGCCGC CGCCCTGGTG
TTGTGGTTGC TGGTTCCGGC CATCATTCGC CAATTGCGGG GAATTCCGCT TGAGAGCGAT
AATACAATTC GCGCCGCGCT GCCACCCCAA GTGCCCGAGC CAGTTGTGCC ACAATTAAGC
TGGCAATGGC GAAGCGATTG GAAACTGTTT GCAGGTCTAT CGGGCTTGGC GGTCATTATG
TTGCTGGTGA TTGGGCTGAA TCGTGCGCCT GCCTTGACCC GCAACGGCGT GCGCCAAGAT
ATGGTTACCC GCACCGAGGC CCTCGCCCAA GAGCGCAAAA TCGATCTAAC AGGCCTGAAC
CCCAGTGTTA CGGTCGTGGC CGATTGGGTT GACCTTGATT TAGCCTATAT CTACGATCAA
CTAGACCCTG AGCAAACCGC TGCTGCGATT GAAAAAGGCA CAGTTCGCGC CTGGAGCGTG
CGCTGGTCGA ATTGGGATCG ACCTGAGTAT AGCTGGTTGG TATATCTCGA TCCAGCTGGG
CGTTTGCTAT CGTATCGCTT AAGTTTGCCC GAAAATGCCA AGGCGATTTC AACCACGCTC
GAACAAGCGC AAACGATCGC AACCACCCAT GCCAGCCAAT TTATCGCGCT TGATCAGTAT
GAGTTGAGTA ATACTAGTAC CAGCCAAAAG CCCAATCGCA ATGATTATAG CTTTGTTTGG
CAGACTAAAA TGCCATGGAT TGGCGAGGCC TATCGACGCT TCGAGGTAAC CGTAGCTGGC
GATCAAGTAA TTGTCAGCTC GCCCTCGATG TATACCCCGC CTGAATATCG GCGTGAGCGC
GACCAAACTA CCCTGAGCGA AAGCATTCTG AGCAATTTGC GCAGCGCCTT GCGTGGCATT
CCGGCCACAA TCTTGCCAAT TTTGGGCTTG ATTGGCATTT TTCGGCGGCG TACTGAGCTT
TGGCCGTGGG TTTGGCTAGG AATTATTGCC GGGATTGGCT ACTTAATTCA AGGTGCTGGG
CGTTGGACAT TTGCCCAAGT TAGCGAACTA CCACGCTTTA TTTTAGCAAT CAGCCAAACC
TTGGCGAATG CCGTCTTGAA TGGTGGCGAG CTAGCCTTAT TGGGAGCAGG CGCGGCGACC
GCTTGGAACC TGACCAAAAA CGAGCAACAA TTGCCGCTTG AGGCTTTTAT ACGGGCAATT
CCGCATCGAA TTCAAGATTT TGTGGCTGAT CGAGCGCAAG TTTTGCGGCG CGAAAGCATT
GTGTTAGGGA TTTTGATCGT GCCATGGATC TTGCTGATTC GCAGCAGCAT TGGGTTTACA
ACAGCCAAAG CTGGGTTTTG GGCCAGTGTT CAACCATTGA ACGCACAATC AGCAATGCTT
GATATACTAT TACAAGCAAC ATTTGATGCA ATTACCACAA GTTTGCTCTT GATCGGTAGC
CTGAGCATGC TAACATGGGT AGTACGTGGG AAGCAGCAGA TCGCCTTGGC AATTACCTGT
CTTGGAATGG CATTGGTCCT CTTGCCGTTA CGCGAACCAG CTCAATGGCT CGTTTTAGGG
CTAGCTGTGC TGTTGAGCTT TTGGCTTGGC CGCATGCTGC GCTGGAATGG CTTGGCTTTA
ACTGTTGCTT TATGGTTGGC AAACATTGTA CCAGCCGCCC TAACCTTACT TGCAACAACC
CCACTTGCAT TGCAACTGAA CGGCGCAGCC TTGATTGTGT TGCTCTGTGG CTTTTGCGGG
TGGTATCTCG GTGGATGGTG GCAAACACAA AATGCAGAAA ATTAA
 
Protein sequence
MHKPDTKRLI PPIFVLLAIF GIFLAIVFAA ESTPMAEIRF NLDREAALQR SVEALQAAGG 
DPSQFTQTIT FGSNNDAQSY LIRERSRALL NQRVDEDLNL ASWNVRFWRE LDPEQWSISL
SPATGRILAI NHMRPDEAAG ATLSQTEALL LAQAQLPIPL DQLTLLDQFT NQQPNRTDHT
FIWQRRDISD AEAQYRYSVT IAGDQLGQRG EYYWLPQSWY LDKDWQLRRG GILNHTGWTL
TYALTALIGV AWFIQARRGR LRWRWALRLF AAVAAVGVLV MLNSIPLDLA HYDINQSLPV
YWGNVLGGYV GQLVTIATTI ILAGMAGEAL VWEETEGTVS LSETLTHRGL VSRPVVQALW
VGGLVGIFQL GFVSAFYALG SRYFGVWSPV TPLYDDTIAT PFPALYGMAL GLLPAIGEEL
IFRLGGITVL TRYFGRPKLA IIVTAVVWAA LHATYPQHPF YIRVVELSII GILFGFLSVR
YGVLASIAAH YTYNASLYVP LFWKTNNFYL LSGIAAAALV LWLLVPAIIR QLRGIPLESD
NTIRAALPPQ VPEPVVPQLS WQWRSDWKLF AGLSGLAVIM LLVIGLNRAP ALTRNGVRQD
MVTRTEALAQ ERKIDLTGLN PSVTVVADWV DLDLAYIYDQ LDPEQTAAAI EKGTVRAWSV
RWSNWDRPEY SWLVYLDPAG RLLSYRLSLP ENAKAISTTL EQAQTIATTH ASQFIALDQY
ELSNTSTSQK PNRNDYSFVW QTKMPWIGEA YRRFEVTVAG DQVIVSSPSM YTPPEYRRER
DQTTLSESIL SNLRSALRGI PATILPILGL IGIFRRRTEL WPWVWLGIIA GIGYLIQGAG
RWTFAQVSEL PRFILAISQT LANAVLNGGE LALLGAGAAT AWNLTKNEQQ LPLEAFIRAI
PHRIQDFVAD RAQVLRRESI VLGILIVPWI LLIRSSIGFT TAKAGFWASV QPLNAQSAML
DILLQATFDA ITTSLLLIGS LSMLTWVVRG KQQIALAITC LGMALVLLPL REPAQWLVLG
LAVLLSFWLG RMLRWNGLAL TVALWLANIV PAALTLLATT PLALQLNGAA LIVLLCGFCG
WYLGGWWQTQ NAEN