Gene Haur_3672 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_3672 
Symbol 
ID5735533 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp4616105 
End bp4617241 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content38% 
IMG OID641280821 
Producthypothetical protein 
Protein accessionYP_001546436 
Protein GI159900189 
COG category[R] General function prediction only 
COG ID[COG1106] Predicted ATPases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCCGAAC TTCAGCTTAA TTCATTAATC ATTCAGAATT TTCGGGGCTT CGAAAATTTT 
CAGATCAACC AACTTGGACG GGTAAACCTG ATTGTTGGTA AAAATAATAT TGGCAAAACT
TCTCTGTTAG AGGCGATTTG GCTGTATGCC AACCGTGGTT CAAGCGTGAC AATCTACGAT
ATTTTGAAAG ATCGCGACGA GCATCGGGTG TTTAATCTGA ACCCTACTAC CGAACAAACC
CAACAAGAAA TTCTTGCCAT CAAAAATTTG TTTTATCAAC GCAATGACTT TACCGACCAG
ACCCAAACGC TGCAAATTGG CAATACTCGC GAGAATTATC TCCAACTTCA GGCGCGTTGG
TATCAAGTTG AGATGGATGA TCACGATAAT TTAACTCCCA AACCAATTAA ATATGCAGAT
CTTGATTTTA GCGATGAGCC ATTTTTTGGT GTCGAAGTAG TAATGTATAG AAATGGTAAA
TCCGCTCAAA AAATTAGAAA ATATCCAATC TATCGCCAAA ATCCTACCCA GAATTGGAAT
GAAATTGTTT GTAACTTTAT TACATCAAAT TTTGTTCATC GCTATCAGCT TAGCAAATGG
CGTGATACAA CCCTGATTGA AGGACTTGAG AATTATGCAC TTGAGGCATT GCAAATTATT
GAACCTTCGA TTGAAGCGAT TAATATGATT ACGGTTGAAG AAAAAGTAAC AACCGATTTC
TCGATAAGCT CAAGGTTGGT TCCCATTCCT GTGGTTAGAA TGGTCGGGGC AACCAAATTT
ATTCCTTTGC GCAGCTTAGG CGATGGTTTG AATCGCATGC TGATTTTGAT TTTAGCAATG
GTCAATGCCA AAGATGGCTT TGTATTAATT GATGAAATTG AAAATGGCCT GCACTATTCA
ATCTATCCTA ATGTTTGGAA ATTGATTTTT AAGCTAGCTG AAACCCTAAA TGTTCAAGTA
TTTGCGACAA CTCATAGCAA AGAATGTTTA AATGCCTTCA ATAAAACCAA TAAAGATCAA
GCCGCTCAAT CAGGGCGATT AATTCGCTTG GGTCGCAAAA AAGGCAACAT CGTTGCAACT
GAATATAATC AAAAAGATAT GCAAGTTATC CTCGAACGTG ATATTGAGGT ACGCTAG
 
Protein sequence
MAELQLNSLI IQNFRGFENF QINQLGRVNL IVGKNNIGKT SLLEAIWLYA NRGSSVTIYD 
ILKDRDEHRV FNLNPTTEQT QQEILAIKNL FYQRNDFTDQ TQTLQIGNTR ENYLQLQARW
YQVEMDDHDN LTPKPIKYAD LDFSDEPFFG VEVVMYRNGK SAQKIRKYPI YRQNPTQNWN
EIVCNFITSN FVHRYQLSKW RDTTLIEGLE NYALEALQII EPSIEAINMI TVEEKVTTDF
SISSRLVPIP VVRMVGATKF IPLRSLGDGL NRMLILILAM VNAKDGFVLI DEIENGLHYS
IYPNVWKLIF KLAETLNVQV FATTHSKECL NAFNKTNKDQ AAQSGRLIRL GRKKGNIVAT
EYNQKDMQVI LERDIEVR