Gene Haur_5152 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_5152 
Symbol 
ID5737110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009973 
Strand
Start bp220882 
End bp222030 
Gene Length1149 bp 
Protein Length382 aa 
Translation table11 
GC content60% 
IMG OID641282317 
Producthypothetical protein 
Protein accessionYP_001547908 
Protein GI159901662 
COG category[R] General function prediction only 
COG ID[COG3889] Predicted solute binding protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.698253 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAACACG ATGACGGGTC GTCAGTGCGG AAACGAGGGC GGGTGCGAAC TGGAGTCTGG 
GTGGGGGTGG GCATGCTTGG GCTGGTGGCG TGGTTGCTCG CCCTGCCAGC GGTCTATCTA
GCGAGGGCCG CAACCGATCC GCCGCTGCGT CCGCTGCATG GGGGCGCGCA TCCTGGGGCG
GTATGGCCCC TCGGGTCGGT CATGCCCTCG CCAACGGGGA CGGACGGAGC CGGATCAGTC
CAGTCCGGAA TCCAATCAAC CCTATGGGAG ACCGCCTACC ATAGTGCCCT TGGGGTCGAT
CCGCGCGTGG AACTCGTCCC TGGGCAAACC TATCCCTTTG TGCATGCGGC GCATGGGGTG
ATCGACCAAA CGACCGGGGT GACGATTTCC GTGATCAACA CTGAGTCATC CTTCCTGGGA
ATAGCGCTGT ACTATGGCAC GATGTCGGGG AGTCAGGCTG GTGAAGCCAC GCCTTGCCCG
ACCAATTACC CGGTGTTTTT CAGTGGAAGC TTTCGCGGTC ACCCCCCGCA ATCCGAAGCG
TATATCATCT CGAAGTGCAC GATTCCGCCC CATCCAAGCG GGACGCGGGT CTGGTATCGC
CTGGTCCTTG TGAAGCCTGG TATCTACCGC TACTTGCAGG CAGATGCTAG TGATGGGTGT
GAAGAACTGA CCCCGCTCGG CCAATGTATC CAAGAATCAT TCTTTTTTCC GCGGATTGGC
GTAGGAAATT ACTTCTACGA TGTCGGCTAT GCCAGCCCAA CGCCAACGGA GACCCCGACA
CCAACCAACA CGCCAACGGA GACCCCGACA CCGACCAACA CACCAACCAA CACGCCAACG
GAGACCCCGA CGAATACACC GACCAACACG CCAACCAACA CGCCAACGGA GACATCGACA
CCGACCAACA CGCCAACCAA CACGCCAACG GAGACCCCGA CGAATACACT GACCAACACG
CCAACCAACA CGCCAACGGA GACCCCGACG AATACACCGA CCAACACGCC AACCAACACG
CCAACGGAGA CATCGACACC GACCAACACG CCAACCAACA CGCCAACCAA CACACCGACC
AACACACCAA CAGCCACCAT TGTTCTTCCG CCGCGCTACA CCATGTTTTT GCCGTGGGCG
CAGAAATAA
 
Protein sequence
MEHDDGSSVR KRGRVRTGVW VGVGMLGLVA WLLALPAVYL ARAATDPPLR PLHGGAHPGA 
VWPLGSVMPS PTGTDGAGSV QSGIQSTLWE TAYHSALGVD PRVELVPGQT YPFVHAAHGV
IDQTTGVTIS VINTESSFLG IALYYGTMSG SQAGEATPCP TNYPVFFSGS FRGHPPQSEA
YIISKCTIPP HPSGTRVWYR LVLVKPGIYR YLQADASDGC EELTPLGQCI QESFFFPRIG
VGNYFYDVGY ASPTPTETPT PTNTPTETPT PTNTPTNTPT ETPTNTPTNT PTNTPTETST
PTNTPTNTPT ETPTNTLTNT PTNTPTETPT NTPTNTPTNT PTETSTPTNT PTNTPTNTPT
NTPTATIVLP PRYTMFLPWA QK