Gene EcolC_1474 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1474 
Symbol 
ID6067208 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1626359 
End bp1627345 
Gene Length987 bp 
Protein Length328 aa 
Translation table11 
GC content54% 
IMG OID641600894 
Productcobalamin synthesis protein P47K 
Protein accessionYP_001724464 
Protein GI170019510 
COG category[R] General function prediction only 
COG ID[COG0523] Putative GTPases (G3E family) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00100223 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00332333 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
GTGACCAGGA CCAACCTCAT CACCGGTTTT CTCGGCAGCG GGAAAACCAC GTCGATTCTT 
CATCTGTTAG CCCATAAAGA TCCCAACGAA AAATGGGCGG TACTGGTTAA TGAATTTGGG
GAAGTCGGAA TTGATGGTGC TTTGCTCGCC GATAGCGGCG CATTGCTGAA AGAGATCCCC
GGCGGCTGCA TGTGCTGCGT TAATGGTTTA CCCATGCAGG TAGGGTTGAA TACCTTACTG
CGTCAGGGAA AACCAGACCG CTTGTTGATA GAGCCGACCG GGCTGGGCCA TCCGAAACAG
ATCCTCGATC TGTTAACCGC ACCAGTCTAT GAACCGTGGA TAGATCTGCG CGCCACCTTG
TGCATTCTCG ATCCACGCCT GCTGCTGGAC GAAAAAAGCG CCAGCAATGA AAACTTCCGT
GACCAGCTGG CTGCCGCAGA CATCATTGTC GCCAATAAAT CCGACCGTGC GACGCCCGAA
AGTGAGCAAG CGCTACAGCG TTGGTGGCAG CAAAATGGTG GCGATCGACA ATTAATTCAC
AGTGAGCATG GGAAAGTTGA CGGTCATCTT CTGGATTTGC CGCGTCGCAA TTTAGCCGAG
TTGCCCGCCA GCGCCGCGCA TTCTCATCAG CATGTCGTGA AAAAAGGGTT AGCAGCGTTA
AGCCTGCCAG AGCATCAACG CTGGCGTCGC AGTCTGAACA GCGGGCAAGG ATATCAGGCC
TGCGGCTGGA TATTCGACGC TGATACGGTA TTCGACACCA TTGGCATTCT GGAATGGGCG
CGACTTGCAC CGGTGGAACG CGTCAAAGGC GTGCTGCGTA TTCCCGAAGG GCTGGTACGA
ATCAACCGTC AGGGCGATGA CCTGCACATT GAAACGCAAA ACGTTGCGCC ACCGGACAGC
CGTATTGAGC TGATTTCCAG CAGCGAAGCT GACTGGAATG CCTTACAGAG CGCGCTGTTG
AAGCTTCGTT TAGCGACTAC CGCGTAA
 
Protein sequence
MTRTNLITGF LGSGKTTSIL HLLAHKDPNE KWAVLVNEFG EVGIDGALLA DSGALLKEIP 
GGCMCCVNGL PMQVGLNTLL RQGKPDRLLI EPTGLGHPKQ ILDLLTAPVY EPWIDLRATL
CILDPRLLLD EKSASNENFR DQLAAADIIV ANKSDRATPE SEQALQRWWQ QNGGDRQLIH
SEHGKVDGHL LDLPRRNLAE LPASAAHSHQ HVVKKGLAAL SLPEHQRWRR SLNSGQGYQA
CGWIFDADTV FDTIGILEWA RLAPVERVKG VLRIPEGLVR INRQGDDLHI ETQNVAPPDS
RIELISSSEA DWNALQSALL KLRLATTA