Gene Haur_3809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3809 
Symbol 
ID5735673 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4781061 
End bp4782161 
Gene Length1101 bp 
Protein Length366 aa 
Translation table11 
GC content54% 
IMG OID641280961 
Producthypothetical protein 
Protein accessionYP_001546573 
Protein GI159900326 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000111673 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTCCGAC GAGCAGCCAT CCATCGCTGG TTATTGGTTT TAGTCGCGCT ATTTGTGGCC 
ATGCCAGTTG CGGCGCAAGC CCCGACCGAA GGCGCAAAAG CTGGCGCGTG GATTGTCACT
CAGGCTCAGG CCGATGGCAG TTTTCCTGGC TTTGGGTTGG GCGAAACCGC CGATGCCGTC
TATGCCTTGA AGGCAACTGG CTTGAAAGTT GATCTCAACG TTCAAAGCTT TATCGAAAAA
AATGCTAGTG CAATCGCCGC TAAGCCTGGG GTTGCCGCCA AGTTTGTGTT GGCCGAATTG
TTGCTGGGCT ACAATCCTCG CGCCGTTGCC GGAACCGATT TGGTCGCGGC TGTAACTGGC
AGCTACAAAG CCGATAGTGG TATGTATGGT GGCGATGTCA CGACCCATGC CTTGGCTTTG
TTGGCCTTGA ATGCTGCTGG CGCACCAGTC GAAAACAAAG CGATCAACAC CTTGAACTCG
GTCCAAATCG CCGATGGATC ATGGTCGTTC AGCGGCGATA CAACTGCTGG CGCTGGCGAT
ACCAATACCA CCGCTTTGGT AGTACAAGCC TTGGTTGCGA TTGGTCAAGG TAAGAGCGAA
GCTGTTACCA AGGCGCTGAG CTACTTGCAA AGCCAACAAA ATAGTGATGG TGGTTTTCCC
TATTCGAAGG CTTCGAGCTA TGGCAGCGCC ACCGATGCCA ACTCAACCGG CTTGGTAATT
CAAGCGATTG TGGCAACTGG CGGTAATCCA ACCGCTGCAC CTTGGGCGAC GGCGACTGGC
AATCCATTGA GTGCCTTGTT GAGCTTGCAA AATGCCAGCG GTGCGTTCCG CTACGATGCA
GCAACGCCCG ATGATAATGC GTTTGCGACA TATCAAGCTA CGCCTGCTTT GTTCTATGTG
ACCTATCCTT TGACCGCCTT GGTGACAGCA CCCCAACCAA CTGCTGTGCC AAGCACACCT
GTGGCAACCG CTACCCCCAA ACCAAACACC CCAATTACCT TGCCCGACAC TGGCGCACCT
GCATTGCCAT TATGGCCAGT GGTGATTGTG TTTGGCTTGG CTTGTATTGT GGCTGGTTTA
CGTTTGCGCC GCGTCGCTTA A
 
Protein sequence
MLRRAAIHRW LLVLVALFVA MPVAAQAPTE GAKAGAWIVT QAQADGSFPG FGLGETADAV 
YALKATGLKV DLNVQSFIEK NASAIAAKPG VAAKFVLAEL LLGYNPRAVA GTDLVAAVTG
SYKADSGMYG GDVTTHALAL LALNAAGAPV ENKAINTLNS VQIADGSWSF SGDTTAGAGD
TNTTALVVQA LVAIGQGKSE AVTKALSYLQ SQQNSDGGFP YSKASSYGSA TDANSTGLVI
QAIVATGGNP TAAPWATATG NPLSALLSLQ NASGAFRYDA ATPDDNAFAT YQATPALFYV
TYPLTALVTA PQPTAVPSTP VATATPKPNT PITLPDTGAP ALPLWPVVIV FGLACIVAGL
RLRRVA