Gene Haur_4200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagHaur_4200 
Symbol 
ID5736062 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameHerpetosiphon aurantiacus ATCC 23779 
KingdomBacteria 
Replicon accessionNC_009972 
Strand
Start bp5355563 
End bp5356573 
Gene Length1011 bp 
Protein Length336 aa 
Translation table11 
GC content50% 
IMG OID641281355 
Productcobalamin synthesis protein P47K 
Protein accessionYP_001546960 
Protein GI159900713 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000951795 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACGC CGATTCCAAT GACGATTTTG ACCGGATTTT TGGGCGCAGG CAAAACGACG 
TTGCTCAATC GGCTGCTTAG TGCCCAGCAT GGCCTCAAAA TCGCGGTGCT GGTTAACGAT
TTTGGCGAGA TTAATATTGA CTCGCAGTTG GTAGTTGGGG TTGAAAATGA CGCTGTAATT
AATTTGGCGA ATGGCTGTAT TTGCTGCACC ATTCGCGAGG ATTTACTCAC CACCACCCTC
GAATTGCTTG AGCGCGATGA TCGACCTGAA TATATCATCG TCGAAGCCAG CGGCGTTTCC
GACCCGGTTT CGGTGGCATT AACCTTCCGT TTGCCCGCCC TGCGCTCGTT GATCAATCTC
GATTCGATTG TGGCGGTCGT TGATGCTGAG AGTATTCACC AACAACGTGA ACAATTGATT
CAAGTCGTCG ATCAAATTGC CGCCGCCGAT CTTGTTGTGA TCAATAAAAT CGATTTGGTT
GATGCTGCCC AACAGCAACG GGTGATTGCC TGGATTCAGA CGATTGTGCC ACGGGCGCGA
ATTTTGACCG CTGAATATGG CGAGGTTCCG GTTGATTTGC TGCTGGGAGT TGGTCAATAT
CGCATCGATT TGCAGGCTGA AGCGTATCCC ACCCAGCATC AACATAACGA AGAATGGCAA
ACCTGGAATT ACCAAACTGA TCAGCCTTTT ACCATGAGCA GCCTGCAACG AGCCTTCCAA
CAATTGCCAA CCGCGATTTT TCGTGCTAAA GGCATTGTAT ATTTGGCCGA AGCACCTGAA
CGCCGCGCAA TTGTTCAGTT GGCGGGCAAA CGTACTAGTT TGCGGCTCAG TGAGCCATGG
GGCGCAGCCA CTCCGTACAG CCAAATTGTG ATAATTGGCC GGAGCAATAG CTTTGATCCA
GCTGAATTGA CCCACCATTT TAATGCTTGT TTGGCAGATG CCACGCAAGA ACCACGCGAA
GAAATTCTGA CTGTGGCCGA ATGGCGACGC AAATACCAAG CCCAATCGTA A
 
Protein sequence
MTTPIPMTIL TGFLGAGKTT LLNRLLSAQH GLKIAVLVND FGEINIDSQL VVGVENDAVI 
NLANGCICCT IREDLLTTTL ELLERDDRPE YIIVEASGVS DPVSVALTFR LPALRSLINL
DSIVAVVDAE SIHQQREQLI QVVDQIAAAD LVVINKIDLV DAAQQQRVIA WIQTIVPRAR
ILTAEYGEVP VDLLLGVGQY RIDLQAEAYP TQHQHNEEWQ TWNYQTDQPF TMSSLQRAFQ
QLPTAIFRAK GIVYLAEAPE RRAIVQLAGK RTSLRLSEPW GAATPYSQIV IIGRSNSFDP
AELTHHFNAC LADATQEPRE EILTVAEWRR KYQAQS