Gene Haur_1905 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_1905 
Symbol 
ID5733794 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp2299446 
End bp2300759 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content52% 
IMG OID641279049 
Producthypothetical protein 
Protein accessionYP_001544676 
Protein GI159898429 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0772428 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACGAG TAGTGTTGTG GATGCTAGCA TGTTTGTTTG TGCTGGGGTT TCAGGCTACG 
GCTCAGGCGG CTGAGCCAGC CAAAATCTCG CGTAACCAAG CAACAAGCAG CTTTGGCGAA
AATGTGGTTT TTGAGCTTGA AGCTGAATCA AGTTCGCCAA TTCGCGAAGT GACGTTTTTA
TATGCCTTGG GTGTGCAGCC AGGCGATGTG CCAGCTTACA CCGAGGCCGA GGCCAAATGG
CAACCTGGTA GCTCGATTGA GGCCAGTTTT ACCCGCGATA CCAGCATTGA ATTTTTGCCA
GTTGGCGTGA CGGTGCGCTA TAAATGGCAG CTGGTTGCCG AAGATGGCAC GATTACTGAA
ACACCTGAGC AATCAGTCCA ATATCAAGAT ACCCGTTTCA ATTGGCAAGA AAAAAGCTCA
CGTGGGATCA CTGTGCGCTG GTATGATGGC GATGAGCAAT GGGGACAAGA TTTGCTCGAT
AGCGCACTTG GTGGGCTTGA TCGGCTTGAG CAGCGGATTG GCGGTTCGGT CGAAGATCCC
ATGACGATCT CGATTTATAG CAATACCCGC GATATGCGCG GCGCTTTGCC ACCCAACTCA
GCCGATTGGA TTGGCGGTCA AGCACGGCCT GACCTTGGCT TGATTATTGG GTCGATTGAT
GCTGGCGATG ACGCTGAATT AGGTCGTTTA GTGCCGCATG AATTAAGCCA TTTGGTGCTG
CATCAAGCAA CCAACAATAA TTATGGTGGT ATGCCAGTTT GGTTCGATGA AGGTTTGGCG
GTTGCCAACC AAGATTCGCC CGACGCTGGC TTTAAGCAAA TGGTTGAGCG GGCTGCCGAA
AATGGCGAGT TGATTCCGTT ACGTGCTTTG GCCTCGAATT TTCCTTCCGA CCCTGAAAAA
GCCCTGCTTT CGTATGCCCA AAGCGAAAGT GTGGTGCGTT ACATCGAATC AACTTATGGC
ATCGAGGCGA TTACCAAACT CGTCGCTCAA TTTAAAAGTG GCGTAACCGA TGATGTGGCG
GTGCAAACTG TCTTGAATCG TAGCCTTGAT ACCTTGGATA GCGAATGGCG CAGCACCTTG
CCTGAAGCGC AAGGCTCTGG CCCAGCCCAA ATCTTGCCCG ACGATACCGC TCCAGCTGAT
CGATTTAGCG AACAACCACG ATCCTCAGCT CCTAGCAACC CAAGCGCACC CAATAGTCCA
GCGGCAACCC CATCAGTGCC CTTGTGGATT TGGCTAGCAG GGATTGGGGG CTTGCTCTTG
ATCGTTTTTG GTACGATTTG GATTATTCGC AGCAGTCGCC AACCACGCTA CTAA
 
Protein sequence
MRRVVLWMLA CLFVLGFQAT AQAAEPAKIS RNQATSSFGE NVVFELEAES SSPIREVTFL 
YALGVQPGDV PAYTEAEAKW QPGSSIEASF TRDTSIEFLP VGVTVRYKWQ LVAEDGTITE
TPEQSVQYQD TRFNWQEKSS RGITVRWYDG DEQWGQDLLD SALGGLDRLE QRIGGSVEDP
MTISIYSNTR DMRGALPPNS ADWIGGQARP DLGLIIGSID AGDDAELGRL VPHELSHLVL
HQATNNNYGG MPVWFDEGLA VANQDSPDAG FKQMVERAAE NGELIPLRAL ASNFPSDPEK
ALLSYAQSES VVRYIESTYG IEAITKLVAQ FKSGVTDDVA VQTVLNRSLD TLDSEWRSTL
PEAQGSGPAQ ILPDDTAPAD RFSEQPRSSA PSNPSAPNSP AATPSVPLWI WLAGIGGLLL
IVFGTIWIIR SSRQPRY