Gene EcolC_1471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1471 
Symbol 
ID6067225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1622892 
End bp1624448 
Gene Length1557 bp 
Protein Length518 aa 
Translation table11 
GC content50% 
IMG OID641600891 
Producthypothetical protein 
Protein accessionYP_001724461 
Protein GI170019507 
COG category[T] Signal transduction mechanisms 
COG ID[COG2200] FOG: EAL domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000861353 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0219942 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCATAC GCGCTCCCAA TTTTGGACGT AAGCTCCTGC TTACCTGCAT TGTTGCAGGC 
GTAATGATTG CGATACTGGT GAGTTGCCTT CAGTTTTTAG TGGCCTGGCA TAAGCACGAA
GTCAAATACG ACACACTGAT TACCGACGTA CAAAAGTATC TCGATACCTA TTTTGCCGAC
CTGAAATCCA CTACTGACCG GCTCCAGCCG CTGACCTTAG ATACCTGCCA GCAAGCTAAC
CCCGAACTGA CCGCCCGCGC AGCGTTTAGC ATGAATGTCC GAACGTTTGT GCTGGTGAAA
GATAAAAAAA CATTCTGTTC ATCTGCGACC GGTGAGATGG ACATTCCACT CAATGAATTG
ATTCCGGCGC TCGACATTAA TAAAAACGTC GATATGGCGA TCTTACCCGG CACGCCGATG
GTGCCGAACA AACCCGCAAT CGTCATCTGG TATCGCAACC CTTTGCTGAA AAATAGCGGC
GTCTTTGCCG CTCTGAATCT CAACCTGACG CCTTCACTCT TTTATAGTTC ACGGCAGGAA
GATTACGATG GCGTCGCCCT CATTATTGGC AATACTGCGC TATCTACCTT TTCTTCACGT
TTGATGAACG TTAACGAATT AACCGACATG CCAGTCCGTG AAACTAAAAT TGCGGGCATT
CCTCTGACCG TTCGGCTTTA TGCAGATGAC TGGACATGGA ACGATGTGTG GTACGCATTT
TTACTGGGCG GCATGAGTGG AACTGTCGTT GGCCTGCTCT GCTATTACCT GATGAGCGTA
CGTATGCGCC CCGGCAGAGA AATCATGACC GCCATCAAGC GCGAACAATT TTACGTGGCG
TATCAACCGG TGGTGGATAC ACAAGCTTTG CGAGTAACGG GCCTGGAAGT ACTGCTACGC
TGGCGGCATC CTGTCGCGGG AGAAATTCCC CCGGATGCCT TCATTAACTT TGCCGAATCG
CAAAAGATGA TTGTGCCGCT GACTCAGCAC CTGTTTGAGT TAATTGCCCG CGATGCCGCA
GAATTAGAAA AAGTGCTGCC GGTAGGCGTC AAATTTGGTA TTAACATTGC GCCGGACCAT
CTGCACAGCG AAAGCTTTAA AGCAGATATC CAGAAACTGC TCACTTCCCT GCCCGCACAC
CATTTCCAGA TTGTGCTGGA AATTACCGAG CGCGATATGT TGAAAGAGCA AGAAGCCACA
CAACTCTTCG CCTGGCTGCA CTCGGTCGGC GTAGAAATTG CTATTGATGA CTTCGGCACC
GGGCACAGCG CGCTTATCTA TCTTGAGCGT TTTACGCTCG ATTATCTGAA AATTGACCGT
GGATTTATCA ACGCCATCGG TACGGAAACG ATCACTTCAC CCGTACTTGA CGCGGTGCTG
ACGCTGGCGA AACGCCTCAA TATGCTGACG GTTGCTGAAG GGGTCGAAAC GCCGGAACAG
GCGCGATGGC TAAGCGAACG CGGCGTTAAT TTCATGCAAG GCTACTGGAT TAGTCGCCCG
TTACCGCTGG ACGATTTTGT TCGCTGGCTG AAGAAACCGT ATACGCCGCA GTGGTAA
 
Protein sequence
MFIRAPNFGR KLLLTCIVAG VMIAILVSCL QFLVAWHKHE VKYDTLITDV QKYLDTYFAD 
LKSTTDRLQP LTLDTCQQAN PELTARAAFS MNVRTFVLVK DKKTFCSSAT GEMDIPLNEL
IPALDINKNV DMAILPGTPM VPNKPAIVIW YRNPLLKNSG VFAALNLNLT PSLFYSSRQE
DYDGVALIIG NTALSTFSSR LMNVNELTDM PVRETKIAGI PLTVRLYADD WTWNDVWYAF
LLGGMSGTVV GLLCYYLMSV RMRPGREIMT AIKREQFYVA YQPVVDTQAL RVTGLEVLLR
WRHPVAGEIP PDAFINFAES QKMIVPLTQH LFELIARDAA ELEKVLPVGV KFGINIAPDH
LHSESFKADI QKLLTSLPAH HFQIVLEITE RDMLKEQEAT QLFAWLHSVG VEIAIDDFGT
GHSALIYLER FTLDYLKIDR GFINAIGTET ITSPVLDAVL TLAKRLNMLT VAEGVETPEQ
ARWLSERGVN FMQGYWISRP LPLDDFVRWL KKPYTPQW