Gene EcSMS35_4030 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagEcSMS35_4030 
Symbolade 
ID6142999 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameEscherichia coli SMS-3-5 
KingdomBacteria 
Replicon accessionNC_010498 
Strand
Start bp4117402 
End bp4119168 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content54% 
IMG OID641618855 
Productcryptic adenine deaminase 
Protein accessionYP_001745993 
Protein GI170680087 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG1001] Adenine deaminase 
TIGRFAM ID[TIGR01178] adenine deaminase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value0.377933 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATT CTATTAACCA TAAATTTCAT CACATTAGCC GGGCTGAATA CCAGGAATTG 
TTAGCCGTTT CCCGTGGCGA CGCTGTTGCC GATTATATTA TTGATAATGT CTCTATTCTC
GACCTGATCA ATGGCGGAGA AATTTCCGGC CCAATTGTGA TTAAAGGACG TTACATTGCT
GGTGTTGGCG CAGAATACAC TGATGCTCCG GCTTTGCAGC GGATTGATGC CCGCGGCGCA
ACGGCGGTGC CAGGGTTTAT TGATGCTCAC CTGCATATTG AATCCAGCAT GATGACGCCG
GTCACCTTTG AAACCGCTAC CCTGCCGCGT GGCCTGACGA CCGTTATTTG CGACCCTCAT
GAAATCGTCA ACGTGATGGG CGAAGCCGGA TTCGCCTGGT TTGCCCGCTG TGCCGAACAG
GCAAGACAAA ACCAGTACTT ACAGGTCAGC TCTTGCGTAC CCGCCCTGGA AGGCTGCGAT
GTTAACGGTG CCAGTTTTAC CCTTGAACAG ATGCTCGCCT GGCGGGACCA TCCGCAGGTT
ACCGGCCTTG CAGAAATGAT GGACTACCCT GGCGTAATTA GCGGGCAGAA TGCGCTGCTC
GATAAACTGG ATGCATTTCG CCACCTGACG CTGGACGGTC ACTGCCCGGG TTTGGGTGGT
AAAGAACTTA ACGCCTATAT TGCTGCGGGT ATTGAAAACT GCCACGAAAG TTATCAGCTG
GAAGAAGGAC GCCGGAAATT ACAACTCGGC ATGTCGTTGA TGATCCGCGA AGGGTCCGCT
GCCCGCAATC TCAACGCACT GGCAACGTTG ATCAACGAAT TTAACAGCCC GCAATGCATG
CTCTGTACTG ATGACCGTAA CCCGTGGGAG ATCGCCCATG AAGGACACAT CGATGCCTTA
ATTCGCCGCC TGATCGAACA ACACAATGTG CCGCTGCATG TGGCATATCG CGTCGCCAGC
TGGTCGACGG CGCGCCACTT TGGTCTGAAT CACCTCGGCT TACTGGCACC CGGTAAGCAG
GCCGATATCG TCCTGTTGAG CGATGCGCGT AAGGTCACGG TGCAGCAGGT ACTGGTGAAA
GGCGAGCCGA TCGATGCACA AACCTTACAG GCGGAAGAGT CGGCGAGACT GGCACAATCC
GCCCCGCCAT ATGGCAATAC CATTGATCGC CAGCCAGTTT CCGCCAGTGA CTTTGCCCTG
CAATTTACCC CCGGAAAACG CTATCGCGTT ATTGAGGCCA TCCATAACGA ATTGATTACC
CACTCCCGCT CCAGCGTCTA CAGCGAAAAT GGTTTTGATC GCGATGATGT GTGCTTTATT
GCCGTACTTG AGCGTTACGG GCAACGGCTG GCTCCGGCCT GTGGTTTGCT CGGCGGCTTT
GGCCTGAATG AAGGTGCGCT GGCGGCGACG GTCAGCCATG ACAGCCATAA TATTGTGGTG
ATCGGTCGTA GCGCAGAAGA GATGGCGCTG GCGGTCAATC AGGTGATTCA GGATGGCGGC
GGGCTGTGCG TGGTCCGTAA CGGTCAGGTA CAAAGTCATC TACCGTTGCC CATTGCCGGG
CTAATGAGCA CCGACACGGC GCAGTCACTG GCGGAGCAAA TTGACGCCTT GAAAGCCGCC
GCCCGTGAAT GCGGTCCGTT ACCCGATGAG CCGTTTATTC AGATGGCGTT TCTTTCTCTA
CCAGTGATCC CCGCGCTGAA ACTAACCAGT CAGGGGCTGT TTGATGGCGA GAAGTTTGCC
TTCACTACGC TGGAAGTCAC GGAATAA
 
Protein sequence
MNNSINHKFH HISRAEYQEL LAVSRGDAVA DYIIDNVSIL DLINGGEISG PIVIKGRYIA 
GVGAEYTDAP ALQRIDARGA TAVPGFIDAH LHIESSMMTP VTFETATLPR GLTTVICDPH
EIVNVMGEAG FAWFARCAEQ ARQNQYLQVS SCVPALEGCD VNGASFTLEQ MLAWRDHPQV
TGLAEMMDYP GVISGQNALL DKLDAFRHLT LDGHCPGLGG KELNAYIAAG IENCHESYQL
EEGRRKLQLG MSLMIREGSA ARNLNALATL INEFNSPQCM LCTDDRNPWE IAHEGHIDAL
IRRLIEQHNV PLHVAYRVAS WSTARHFGLN HLGLLAPGKQ ADIVLLSDAR KVTVQQVLVK
GEPIDAQTLQ AEESARLAQS APPYGNTIDR QPVSASDFAL QFTPGKRYRV IEAIHNELIT
HSRSSVYSEN GFDRDDVCFI AVLERYGQRL APACGLLGGF GLNEGALAAT VSHDSHNIVV
IGRSAEEMAL AVNQVIQDGG GLCVVRNGQV QSHLPLPIAG LMSTDTAQSL AEQIDALKAA
ARECGPLPDE PFIQMAFLSL PVIPALKLTS QGLFDGEKFA FTTLEVTE