Gene EcolC_1615 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcolC_1615 
Symbol 
ID6066608 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli ATCC 8739 
KingdomBacteria 
Replicon accessionNC_010468 
Strand
Start bp1797072 
End bp1798052 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content46% 
IMG OID641601030 
Productlipopolysaccharide biosynthesis protein 
Protein accessionYP_001724600 
Protein GI170019646 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3765] Chain length determinant protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.626955 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.791839 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGAGTAG AAAATAATAA TGTTTCTGGG CAAAACCATG ACCCGGAACA GATTGATTTG 
ATTGATTTAC TGGTGCAGTT GTGGCGCGGC AAGATGACTA TCATCATTTC TGTCATTGTG
GCTATTGCCC TGGCTATTGG GTATTTGGCG GTAGCGAAGG AGAAATGGAC GTCAACAGCA
ATTATCACTC AGCCTGACGT GGGGCAAATT GCTGGCTATA ACAATGCCAT GAATGTTATC
TATGGTCAGG CTGCACCTAA AGTGCCGGAT TTACAGGAGG CGTTAATTGG TCGCTTCAGT
TCTGCCTTCT CTGCATTAGC AGAAACGCTG GATAATCAGA AAGAACCAGA AAAACTTACC
ATCGAACCTT CTGTAAAGAA CCAGCAATTA CCATTGACTG TTTCTTATGT TGGGCAAACT
GCAGAGGACG CACAAATGAA ATTGGCCCAA TACATTCAGC AAGTTGATGA TAAAGTAAAT
CAAGAGCTAG AAAAGGATCT CAAGGACAAC CTTGCTTTGG GACGGAAAAA CTTGCAGGAC
TCTTTAAGAA CTCAGGAAGT GGTTGCGCAG GAGCAGAAAG ATCTGCGTAT CCGTCAGATT
CAGGAAGCGT TGCAGTATGC GAATCAGGCG CAGGTGACAA AACCGCAGAT TCAACAGACT
GGCGAAGATA TCACACAAGA TACGTTGTTC CTTTTGGGGA GCGAAGCGCT GGAGTCGATG
ATTAAGCATG AGGCAACTCG TCCGTTGGTG TTTTCGCCAG ACTACTATCA GACTCGTCAA
AACCTGCTGG ATATCGAAAA CTTAAAGGTT GACGATCTTG ATATTCATGC TTACCGTTAT
GTGATGAAAC CGACGTTACC TATTCGTCGC GATAGCCCGA AAAAGGCAAT TACCTTGATT
CTGGCGGTGC TGCTGGGCGG CATGGTTGGC GCGGGGATTG TGTTGGGGCG TAATGCTCTG
CGTAATTACA ACTCGAAGTA A
 
Protein sequence
MRVENNNVSG QNHDPEQIDL IDLLVQLWRG KMTIIISVIV AIALAIGYLA VAKEKWTSTA 
IITQPDVGQI AGYNNAMNVI YGQAAPKVPD LQEALIGRFS SAFSALAETL DNQKEPEKLT
IEPSVKNQQL PLTVSYVGQT AEDAQMKLAQ YIQQVDDKVN QELEKDLKDN LALGRKNLQD
SLRTQEVVAQ EQKDLRIRQI QEALQYANQA QVTKPQIQQT GEDITQDTLF LLGSEALESM
IKHEATRPLV FSPDYYQTRQ NLLDIENLKV DDLDIHAYRY VMKPTLPIRR DSPKKAITLI
LAVLLGGMVG AGIVLGRNAL RNYNSK