Gene Haur_5215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5215 
Symbol 
ID5737173 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp307940 
End bp310012 
Gene Length2073 bp 
Protein Length690 aa 
Translation table11 
GC content45% 
IMG OID641282379 
Producthypothetical protein 
Protein accessionYP_001547970 
Protein GI159901724 
COG category[S] Function unknown 
COG ID[COG3472] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.741994 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATGATA CCCAAAGCCA AACGGAATCA CTGCTTGATA TTGTTGCTGG TATTAAACAA 
CGCACGATTA TGTTGCCAGA ATTTCAGCGC GATTTTCGCT GGGAATTGCC GCAAACCTAT
GATCTGTTTG ATTCCCTGAT TCGTGACATC TTTATTGGGA CGGTTATTTA TGGGAAACCG
TCGTTTGAAA TGACCTTGCG CGAAATTGAT ACCCGTCCAC GGAAAGGGAA AGGGGCAAAT
GCCCCGTTGG TTACCCACTA TTACTCGGCA CAGCAGATTA CCACAAAAAG CCAAACGCAA
AATCTGCGCA TTGTGTTGGA TGGACAACAG CGCCTCACCT CACTCTATCG TGCCATTACT
GGAGAAGGGA ATGACTCGGT GTTTGTGATT CTCCATGAAG GTATCGATCT GGAAAGTGTG
AGCCAAAAAA GCTTAGAAGA ATTGGTAGAT CGGATTGCTG GTGAGGAAAG TCGGACAGCG
ATTTCGGTCA AACTGTCCTT AGCCTATCTC TCAGAAACCG ATAGCTCGAT TGAACCAGAA
GATCAATACC AATGGTTTGC CGATACTGTG TATGGCCGTG ATGTGATCCA AACAGCAGAT
CTCGACTTCC AGAAAAAGGC TGAACGCGCC TATCGCCGCA TCCTGACGAA ACTGACGGAT
CTCTTCAAAC AACAAAAAAT GGTTGCGTTT TATTTGCTCG ATATGTCGCT TGATAAATTT
TGCTTGTTTT TTGAGCGCAG TAATAGTCGT GGTATCCAGC TCAATTTTAC CGACATTTTG
GCTGCAAAGC TCTATCATGG CTTTAATCTC CGCGCCAAAA TTGAAGAATT TGAGAGCCAA
GCAAAGATTA AGGTGAATCG TGAGATTATT ATTCGGGCGA TTGCCTATAT TGTTGGCTCG
AGTCGTGGCC GAAGCCCTGA GATCGATAAA AAATATATCT TAGAGACGCT CAACGCGGAC
GACTTTAACA CCTATTGGGA TGAGGTCTGC CGTTTATATC GTGAGTGCTT GGGCTATTTA
ACCAATCAAC ACTATATTTT ATCGCAAGAC TGGATGCCCT TCGAGAATAT GGTCATCCCA
CTCATGATGT TCCTCCGTCA GATTAAGAGC TTTGATCGCA TGAACGAGCA ACAGCGGAGT
TTTCTTGAAT TTTGGTATTG GGCATCGATT TTTGCCAATC GCTATAGTTC AGCATCGAAT
GCAACGATTA TCATCGATTG TAAGGTCTTG ATACAGGTGG CGCATGGTGA GCGGATTGAC
AACCGCAGTT ATTTTATGCG TTTGCGGTCA TTAGTAACAG AATCCCTCGA TTTGTTGAGC
TACACAAAAA AGACGAGTTC GATTTACCGA GGGATTCTGA ACCTGATCAA CTATGCAGCA
AAAGGATTGC GCGACTGGAG TAATGCCCAG ATTCTTGATG TGAGTATGCG CCTTGAGGAT
CATCATATTT ATCCACGGGG CTATATTACC AGTAAACCAG AACTGGATAT TGATCACAAT
GAGGCAGAAC AATTAGTTGA TTGTGTCGTG AATCGCACCC TGATTCCGAA AATTAAGAAT
ATCACGATTG GGAAAAAAGC ACCCTATACC TATCTGGCGG ATTTAGCCAC GCAGAATAGT
CAGCTTTCGC TGAGCCTTGC AACCCATTTG CTACCAAAGG ATTTTGATAG TGAACCAACC
TGGAATACGT GCTTTAAGCT CTTTTTGGAA GAACGGGCAC AGGCCCTCTT TGCGCTGATC
GATAAGTATG TGATTGAGCC ATTGCCCGAT ATGATCCATC AGCATGCCGT TGTTGGTGAA
GCGACGGAAC AACCCAGCGA GGGTCGTAAA CCACGATTTA ACGATATGGT TCAAGCAAAA
AAGGTTGTGC CAGGCGATCA GCTTTATACG AAAAAATATC CGCAGCGGCG TGCGACAGTG
GTTGATGGGG AGACCGTCGA GTATGACGGG GTACGCTATC CCATTAATGT GTGGGGTGAA
AAAGTGACGG GATGGTCGTC GATTAATATT TATGATTCCG TGATATTAGA ACGAACGGGG
AAGCCATTAA GAAGTTTACG CGAAGAAGGC TAA
 
Protein sequence
MNDTQSQTES LLDIVAGIKQ RTIMLPEFQR DFRWELPQTY DLFDSLIRDI FIGTVIYGKP 
SFEMTLREID TRPRKGKGAN APLVTHYYSA QQITTKSQTQ NLRIVLDGQQ RLTSLYRAIT
GEGNDSVFVI LHEGIDLESV SQKSLEELVD RIAGEESRTA ISVKLSLAYL SETDSSIEPE
DQYQWFADTV YGRDVIQTAD LDFQKKAERA YRRILTKLTD LFKQQKMVAF YLLDMSLDKF
CLFFERSNSR GIQLNFTDIL AAKLYHGFNL RAKIEEFESQ AKIKVNREII IRAIAYIVGS
SRGRSPEIDK KYILETLNAD DFNTYWDEVC RLYRECLGYL TNQHYILSQD WMPFENMVIP
LMMFLRQIKS FDRMNEQQRS FLEFWYWASI FANRYSSASN ATIIIDCKVL IQVAHGERID
NRSYFMRLRS LVTESLDLLS YTKKTSSIYR GILNLINYAA KGLRDWSNAQ ILDVSMRLED
HHIYPRGYIT SKPELDIDHN EAEQLVDCVV NRTLIPKIKN ITIGKKAPYT YLADLATQNS
QLSLSLATHL LPKDFDSEPT WNTCFKLFLE ERAQALFALI DKYVIEPLPD MIHQHAVVGE
ATEQPSEGRK PRFNDMVQAK KVVPGDQLYT KKYPQRRATV VDGETVEYDG VRYPINVWGE
KVTGWSSINI YDSVILERTG KPLRSLREEG