Gene Haur_2381 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_2381 
Symbol 
ID5734262 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp3033183 
End bp3034628 
Gene Length1446 bp 
Protein Length481 aa 
Translation table11 
GC content49% 
IMG OID641279522 
Producthypothetical protein 
Protein accessionYP_001545149 
Protein GI159898902 
COG category[S] Function unknown 
COG ID[COG1262] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00647578 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGATA TCACCAATCG TCAGCAGTTT TGTCGGGCAG TCGGGGATGG GTTAAAGTTA 
GTCCAAAGTC AACGAAGTGG CTCCATGCCA AGTACAGAAG CGTATATAGC CGATTGCTTG
AGCATTGGGG TTGATACACT CCGTACATTA CGCTATGCTT CCCGTCAACG CGTGATGGTC
GAAGATAAAA CTTTAGCCGC ATTAATTTGG ATTGTCCTGG CCGAAGGTCG GGCAACTCAA
CCATGGCTGA TCACAATGCT CAGTGCGACT AGTATTATTC CCCCCGAACC ACTGACAACT
ACTTGGCTTG AGAGCTATCT GCAAAGTGGT TTCAAACAAC AACTCGATCA AGCATTGCTG
AGCCAGGTTG TCCAAAGCAT TCTTCCTAAC CAAGCTCCAA GTGTCCTCAA TCTCGTTGTT
GATACCCAAA TCGATGCATC AGATTTGCCC TTAAATCTTC CAACCCCAGC GCCGAGTATT
CAAGTTAAAT CAAAATTTCC CAACAAACCG TACCAGTGGT TGGGCTTATT TGGGCTGATC
AGTGTGCTTG GCTGGCCGCT GTTTTCATTA TTTCAAGCAA CTGCGGCATC TCATGATGCT
TCAAGCAGCA TGGCCTTAAT TCCTGCTGAT GAATATTTGC AAGGCAGTAG TGATGCTGAT
ATTGCTGAAT ATACGCGTTT GTGCCAAGTG CATCAGACAG GCTGCGATAG CTCGTGGTTT
GCCGATGAAC AACCACAGCG TTTGATTCAA CTTGATGCCT TTGCCATCGA TCGCTTTGAG
GTCAGCAATC GCGATTTTCT GCGTTATAGT GAGGCTAACC CCAGCCTCTT AACCCAAGCT
GAAACGCAGG GGGCAGGCTT TGTCTGGAGC GATAGTAATG GTTTCGAGTT GATCAATGGA
GCAAATTGGC GGCATCCTCA TGGTCCAAAT TCAGCAATTA CTGAACATTT GGATAAACCA
GTGGTGCAAA TAACGCCCAG TGAAGCCCAA GCCTATTGTA TTTGGCAAGG CAAACGCCTG
CCAACCGAGG CCGAATGGGA GGTAGCAGCC CGTGGTAAGC ATTATTGGCG CTTTCCTTGG
GGCAACGATT GGCAGCCCGC TAAGCTCAAT TTTACCCAAG GCAAGCTTAG CCCAGCCTTG
ATGAACGTAG ATAGCCTACC CGAGGGTCAA AGTTTTTATG GGGTGGCGCA TATGCTTGGC
AATGCTGCTG AATGGACTGC CGATTGGTAT GATCCGCATG CCTGTCAGTC CAACGATCGG
CTTAATCCAC GTGGGCCAGT GATTCCAACT GCCCGGCATA CCCGCCGAGG TGGCTCGGTC
GCTAGCATGG CCGGAGTTTT GCATAGCACC TGGCGGATTA GCAACGAGCA AATTAACGAT
CAACCAAGCA ATGGCACAGG CTTTCGCTGT GTCCAACATA TCGCACTAGA TCAGAGCTTG
CCATGA
 
Protein sequence
MADITNRQQF CRAVGDGLKL VQSQRSGSMP STEAYIADCL SIGVDTLRTL RYASRQRVMV 
EDKTLAALIW IVLAEGRATQ PWLITMLSAT SIIPPEPLTT TWLESYLQSG FKQQLDQALL
SQVVQSILPN QAPSVLNLVV DTQIDASDLP LNLPTPAPSI QVKSKFPNKP YQWLGLFGLI
SVLGWPLFSL FQATAASHDA SSSMALIPAD EYLQGSSDAD IAEYTRLCQV HQTGCDSSWF
ADEQPQRLIQ LDAFAIDRFE VSNRDFLRYS EANPSLLTQA ETQGAGFVWS DSNGFELING
ANWRHPHGPN SAITEHLDKP VVQITPSEAQ AYCIWQGKRL PTEAEWEVAA RGKHYWRFPW
GNDWQPAKLN FTQGKLSPAL MNVDSLPEGQ SFYGVAHMLG NAAEWTADWY DPHACQSNDR
LNPRGPVIPT ARHTRRGGSV ASMAGVLHST WRISNEQIND QPSNGTGFRC VQHIALDQSL
P