Gene Haur_2981 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2981 
Symbol 
ID5734853 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3762823 
End bp3765396 
Gene Length2574 bp 
Protein Length857 aa 
Translation table11 
GC content53% 
IMG OID641280125 
ProductIg family protein 
Protein accessionYP_001545747 
Protein GI159899500 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGTTTTC GGCGAACAGC ATACGGGCTA GGATTAAGTT TATTGTTAGC GACCCTCCCC 
CACGCAAGCT TGGCAACCCC AGCGCTCACC ACAACTGGCG TTCCAACCGA CCCCACTCCG
GCAATCAAAC TCACGCGAAT TGGCCGTTAT AACCCAGGCC CATTTCGTAG CGCTGATCCA
CGGGCAGCTG AAATTGTCGA TTTTGATCCG CAAAGCCAGC GCATGGTCTT GATCAATGGC
TTTAACAGCG CCTTGGATAT TGTTGATCTG AGCAATCCGG CCAACCCGCA GTTGCTTACA
ACGATCGCCA TTACGCCCAC CAGCAGCAAT GTGCCCAACA GCGTGGCCGT GCACAATGGC
TTAGTCGCGG TGGCCGCCAA TGCTGCCGTC AAAACCGATC CTGGGCGGGT GGTGTTGTTC
AATCGCGATG GCGTGTTTTT GAATGAAATC ACAGTTGGAG CAGTGCCCGA TATGCTGACC
TTCACGCCCG ATGGCCGCCG GATCGTGGTG GCAATTGAGG GCGAACCCAA CAGCTACAAC
CAAGTTGATT CGGTTGATCC TGAGGGGGCG GTGGCGATTA TCGATTTGCC GCAAAATTTT
GCCAACATTA CAACCACCAG CGTGCTTTCA TCAAGCTTGG TTGGCTTTAC TGATTTTAAT
CTGGGTGGCA GTCGCCATGC TGAGCTTGAC CCGCAAATTC GAATTTTCGG GCCAAACGCC
AGCGTCGCCC AAGATTTAGA GCCGGAATAT TTAACAATCT CTGCCGATTC GAGCAAAGCC
TATGTGACGC TGCAAGAAAA TAATGGCTTG GCCTTGATCG ATCTGAATGC AGGGCGGGTG
CAATGGCTCA AAGCTTTGGG CTATAAAAAT CACAATCTCG CGGGCTATGG GCTTGATCCC
AGCGATAGCG ATGGCATGAA TGCAATTGCG CCATGGCCTG TGTTGGGTAT GTATCAGCCA
GATACGATTA ATAGCTATGC TGCCAATAAC CAAACCTATT TGGTAACTGC CAATGAAGGT
GATGCCCGCG ACTACACCGG ATTTACCGAA GAAGTGCGGA TCAAAAATGT GATGCTTGAT
TCGAGCGTGT TTACCAACGC TGCCAGCCTG CAACAAGATG CCCAACTTGG GCGCTTGAAT
ATCACCAATA CTAAGGGCAA CTTTGGCGGG CAGCACCATG CACTCTATTC ATTTGGCGCA
CGCTCATTCT CAATTTGGGA TGGTACGACA GGTCAGTTGG TATTTGATAG TGGCGATGAT
TTGGAAACCC GGACTGCCGC TACGTTTCCA AATAATTTTA ATGCCAATAA CACCGCCCAC
AGCCGCGATA ACCGTAGCGA CGATAAAGGC CCAGAGCCAG AAGCTTTGGC GGTAGCGACG
ATTGATGGCC GCAGCTATGC CTTTGTCGGC TTGGAGCGGA TGGGCGGAAT TATGGCCTAC
GACGTGAGCA ACCCGCACGC GCCCCAATTT CTCGAATATT TCGCTGCGCG TAGCTTCCCC
AGCAGCTATG TTACTGGCAC GCCCGATGAT CTTGGGCCTG AAGGCATGCA TGTGATCGCC
GCCGAAGATA GCCCAACTGG CAAGCCCTTG TTGTTGGTTG CTAACGAAGT GAGCGGCTCG
GTTTCGATCT ATCAAATTAG CGCCCAAACT CCTCGCATGC ACTTGAACCT GAGCGATGGC
TTAACCAGCG TGCAACCCAA CACCTCGGTT ATTGCCTCGC TGAGCTTGAA TAATCAACAA
ACTGAGCCAA GCGCTCGCCC GGCAACTGAA GTCCAAGTGC AGTATCTTGT GCCAAGCCAA
TTAAGCTACA ACGGTTGTAC AATTGCCAGC CCCTTGGCGG GCACATGTAG CCAGCAAAAT
GGCCTAGTAA CCTTCAATCT GACCACACCA TTTGCCTCGG CTAGCCAAGG CTTGTTGCAG
GTTGCCACCA CGGTCAAGCC CAATGCCACA GGCACAATTG AGCATCAAGC CAGCCTCAGC
TATCGCGATG CTGGCGAATT GCAAACCACG GTTCAAGTCA GCGATACGAC CACAATTGGC
GTTGCACCGT TGATTACCAG TGGCTTGCCC ACGGCGGCGA GCTATGGCGC GATCTATAGC
CACACCCTGA CGGCGAGCGG CATGCCAACC CCAACTCTCA ATCTTGTTGG CAACTTGCCA
GCAGGCTTGA GCTTCGATAG CCAAACTGGA ATTTTGGCGG GTACGCCGAC CACCAGCGGT
AGTTTCCCAA ATTTGATCTT CCAAGTGAGC AACGGAATTG GTACAATGGT AACGCAAAGC
TTTACGCTGA CCGTTGCCAA AGCGCCATTG CAGGTCGTTG CTGATAACCA ACGTCGTTTA
TTCGGCCAAC CCAACCCGCC CTTGAGCTAT CAAGTAACTG GCTTGCGCTT GCAAGATACG
GCTGCAAGTG CATTAACTGG CACATTAACC ACCACCGCAA CCCTCACCAG CCCGCTTGGT
GAGTATCCAA TTAGCCAAGG TAGTTTGCAG GCTCAACACT ACCAAATGAG CTTTAGCGCT
GGCATACTCA CCATCGAAGC CAACGCGGTT TACCTACCCT TGATTGGGAA ATAA
 
Protein sequence
MRFRRTAYGL GLSLLLATLP HASLATPALT TTGVPTDPTP AIKLTRIGRY NPGPFRSADP 
RAAEIVDFDP QSQRMVLING FNSALDIVDL SNPANPQLLT TIAITPTSSN VPNSVAVHNG
LVAVAANAAV KTDPGRVVLF NRDGVFLNEI TVGAVPDMLT FTPDGRRIVV AIEGEPNSYN
QVDSVDPEGA VAIIDLPQNF ANITTTSVLS SSLVGFTDFN LGGSRHAELD PQIRIFGPNA
SVAQDLEPEY LTISADSSKA YVTLQENNGL ALIDLNAGRV QWLKALGYKN HNLAGYGLDP
SDSDGMNAIA PWPVLGMYQP DTINSYAANN QTYLVTANEG DARDYTGFTE EVRIKNVMLD
SSVFTNAASL QQDAQLGRLN ITNTKGNFGG QHHALYSFGA RSFSIWDGTT GQLVFDSGDD
LETRTAATFP NNFNANNTAH SRDNRSDDKG PEPEALAVAT IDGRSYAFVG LERMGGIMAY
DVSNPHAPQF LEYFAARSFP SSYVTGTPDD LGPEGMHVIA AEDSPTGKPL LLVANEVSGS
VSIYQISAQT PRMHLNLSDG LTSVQPNTSV IASLSLNNQQ TEPSARPATE VQVQYLVPSQ
LSYNGCTIAS PLAGTCSQQN GLVTFNLTTP FASASQGLLQ VATTVKPNAT GTIEHQASLS
YRDAGELQTT VQVSDTTTIG VAPLITSGLP TAASYGAIYS HTLTASGMPT PTLNLVGNLP
AGLSFDSQTG ILAGTPTTSG SFPNLIFQVS NGIGTMVTQS FTLTVAKAPL QVVADNQRRL
FGQPNPPLSY QVTGLRLQDT AASALTGTLT TTATLTSPLG EYPISQGSLQ AQHYQMSFSA
GILTIEANAV YLPLIGK