Gene EcolC_2629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_2629 
Symbol 
ID6064665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp2880391 
End bp2881581 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content52% 
IMG OID641602036 
Producthypothetical protein 
Protein accessionYP_001725586 
Protein GI170020632 
COG category[R] General function prediction only 
COG ID[COG1092] Predicted SAM-dependent methyltransferases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.126778 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000254452 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGTGTAC GTTTAGTGTT AGCCAAAGGG CGCGAAAAAT CATTACTTCG TCGCCATCCG 
TGGGTCTTTT CCGGGGCCGT TGCCCGCATG GAAGGTAAAG CCAGCCTCGG TGAAACCATC
GATATTGTTG ATCATCAGGG AAAATGGTTA GCACGCGGCG CTTATTCGCC AGCTTCGCAA
ATCCGGGCGC GCGTCTGGAC GTTTGACCCG TCTGAGTCTA TCGACATTGC TTTTTTTTCC
CGCCGTTTGC AACAAGCACA AAAATGGCGT GACTGGCTGG CGCAAAAAGA TGGCCTCGAC
AGCTATCGTT TAATCGCCGG AGAATCTGAT GGCCTGCCGG GTATTACTAT CGATCGTTTC
GGTAATTTTC TGGTGCTGCA ACTGCTGAGT GCTGGGGCAG AATATCAGCG CGCGGCATTA
ATTAGTGCCC TGCAAACGCT GTACCCGGAA TGTTCGATTT ACGATCGCAG CGACGTCGCG
GTACGTAAAA AAGAAGGAAT GGAGCTGACC CAGGGCCCCG TCACCGGCGA GTTGCCACCT
GCCCTGCTGC CGATTGAAGA ACACGGAATG AAACTGCTGG TGGATATTCA GCACGGACAC
AAAACGGGCT ACTACCTGGA CCAGCGTGAT AGCCGCCTGG CTACCCGCCG CTACGTTGAA
AATAAACGTG TGCTGAACTG TTTCTCCTAT ACCGGTGGTT TCGCCGTATC GGCACTGATG
GGCGGTTGCA GCCAGGTTGT CAGCGTTGAT ACCTCCCAGG AAGCGCTGGA TATTGCACGG
CAGAACGTTG AGCTGAACAA ACTGGATCTG AGCAAGGCTG AGTTTGTCCG TGATGATGTC
TTTAAATTGC TGCGTACTTA TCGCGATCGC GGTGAAAAAT TTGACGTTAT CGTGATGGAC
CCGCCGAAGT TTGTTGAGAA TAAAAGCCAG TTGATGGGCG CGTGTCGGGG CTATAAAGAT
ATCAACATGC TGGCGATTCA GTTGCTGAAT GAAGGCGGTA TTCTCCTGAC TTTCTCCTGT
TCCGGTCTGA TGACCAGCGA TTTATTTCAG AAAATCATCG CGGATGCCGC AATTGATGCC
GGCCGTGATG TACAATTTAT AGAGCAGTTC CGTCAGGCAG CCGATCATCC GGTGATCGCT
ACCTACCCGG AAGGGCTATA TCTGAAAGGG TTTGCCTGTC GCGTCATGTA A
 
Protein sequence
MSVRLVLAKG REKSLLRRHP WVFSGAVARM EGKASLGETI DIVDHQGKWL ARGAYSPASQ 
IRARVWTFDP SESIDIAFFS RRLQQAQKWR DWLAQKDGLD SYRLIAGESD GLPGITIDRF
GNFLVLQLLS AGAEYQRAAL ISALQTLYPE CSIYDRSDVA VRKKEGMELT QGPVTGELPP
ALLPIEEHGM KLLVDIQHGH KTGYYLDQRD SRLATRRYVE NKRVLNCFSY TGGFAVSALM
GGCSQVVSVD TSQEALDIAR QNVELNKLDL SKAEFVRDDV FKLLRTYRDR GEKFDVIVMD
PPKFVENKSQ LMGACRGYKD INMLAIQLLN EGGILLTFSC SGLMTSDLFQ KIIADAAIDA
GRDVQFIEQF RQAADHPVIA TYPEGLYLKG FACRVM