Gene TM1040_0418 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0418 
Symbol 
ID4076178 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp428354 
End bp430087 
Gene Length1734 bp 
Protein Length577 aa 
Translation table11 
GC content59% 
IMG OID638005713 
Productalpha amylase, catalytic region 
Protein accessionYP_612413 
Protein GI99080259 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.537622 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.197095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCACAT CCTTTGCTGC ACGCCTGTCG CAACTGCTGA CCCAAATCTA TCCGGATTTG 
GACACGGAGA TCCTTGGCTC CAAAGTGGTT GAGGCGTTTT GGCCCGAAGG CAGTCACCGG
CGCAAGCGCC CGCGCAGGCC TGGAAATTCT CTTTGGTCAG AACGCGACGC ATTGCTGATC
ACCTATGGCA ACACCATGCG TGATGGGGCG CATAAGCCGT TAGACCTGCT GCATGATTTC
CTGCTGACTT ATATGAAGGG TGTTGTGAAT GGCGTGCATA TCCTGCCGTT TTTTCCCTTC
ACTTCGGACG ACGGGTTTGC CGTCACGGAC TATCGCAAGG TGAATCCTGA GCTGGGTGAT
TGGGCCGACA TTCGCCGGAT CGGCGGCGCG TTTCACCTGA TGTCTGACAT GGTGCTCAAC
CATGTGTCCT CGCAGAGCGC CTGGTTCAAC GCCTACCGGC AGGGGCAGCC GCCTTATGAC
CGCTTCTTCT ACGAGGCTTC GCCCTCGGAC GATCTGAGCG CAGTGGTGCG TCCACGCACA
ACGCCGCTGC TGCAGGAGGT AGAGACAGCC ACGGGCGCGA AACATGTGTG GTGCACCTTC
AGCCACGATC AGGTTGATCT CAATTTTGAG AACCCGGAGG TCCTGCTGGA AATCCTTCGG
ATCATTCGCC TGCATATCGA TCAGGGGGTC CGCATTATCC GGCTCGATGC GGTGGCCTTT
ATATGGAAAG AGGTTGGGAC CAGTTCGATC CACCTGCCGC AAACCCATGC GATTGTACAG
CTGCTGCGCC TGCTGGCAGA TTATGCGACC GAGACGGTGG TACTGCTGAC CGAGACCAAC
GTGCCGCGGG CTGAGAATCT CAGCTACTTT GGCAATCGCA ACGAGGCCCA TGTGGTCTAT
AATTTTCCGC TGCCGCCATT GATCCTGCAC GCGATGATGG CGGGCTCGGC GCGCTACCTG
CTGAATTGGG CACGGGCGAT GCCGCCGGCG CCCCTGGGAT GCGCCTATTT GAATTTCACC
GCGAGCCACG ATGGAATCGG GATGCGCCCG GCGGAGGGGG TGTTGCCGCA GGAGGAAATC
GACCAGATGA TCGCCTGCGT GCGGGCGGTA GGGGGTCTTG TGTCCATGCG GGCCTTGCCG
GGGGGTGGTG AAGCGCCCTA CGAGGTGAAC TGCACCTATT TTGACGCGCT TGGCCGAACC
TTTGACAGGG GCGAAGCGCG AAAGGTGGAT CGATTCATCT GTGCGCAGAC CATTCCCATG
AGCCTTGAGG GAATTCCGGC GTTTTACATT CACGCGATGC TGGCGACGGC CAATGATCAT
GATGCGGTGG CGCGGCGCGG TATGAACCGG GCGATCAACC GCCACCGGTG GGATTACGGC
GAGCTGAAGG CGCGTCTGAA TGACGCGGAC AGCGCGCAGG CTCAGGTGAT GTCGGCGCTC
TCCGAACGGC TGCGGGTCCG GGCCGAGCAG CCGGCGTTTC ACCCCAATGC TACCCAGTTC
ACTTTGCAGC TGGATGATCG TGTCTTTGCG CTCTGGCGGC AGTCGCTGGA CCGGGCGCAG
TCGATCTTTG CGCTGCACAA TGTCAGCGGA GATGGGGTGA TCCTGCATCC CGGCGCGCTG
AACCTTATTG AAGGTGAGAC ATGGCGGGAT CTGTTGTCCG GTGACATGTT TGAAAGCGAT
GCAGAGATCA CACTGGCACC CTATCAATGC CGTTGGATCA CCAATCAGGC TTGA
 
Protein sequence
MATSFAARLS QLLTQIYPDL DTEILGSKVV EAFWPEGSHR RKRPRRPGNS LWSERDALLI 
TYGNTMRDGA HKPLDLLHDF LLTYMKGVVN GVHILPFFPF TSDDGFAVTD YRKVNPELGD
WADIRRIGGA FHLMSDMVLN HVSSQSAWFN AYRQGQPPYD RFFYEASPSD DLSAVVRPRT
TPLLQEVETA TGAKHVWCTF SHDQVDLNFE NPEVLLEILR IIRLHIDQGV RIIRLDAVAF
IWKEVGTSSI HLPQTHAIVQ LLRLLADYAT ETVVLLTETN VPRAENLSYF GNRNEAHVVY
NFPLPPLILH AMMAGSARYL LNWARAMPPA PLGCAYLNFT ASHDGIGMRP AEGVLPQEEI
DQMIACVRAV GGLVSMRALP GGGEAPYEVN CTYFDALGRT FDRGEARKVD RFICAQTIPM
SLEGIPAFYI HAMLATANDH DAVARRGMNR AINRHRWDYG ELKARLNDAD SAQAQVMSAL
SERLRVRAEQ PAFHPNATQF TLQLDDRVFA LWRQSLDRAQ SIFALHNVSG DGVILHPGAL
NLIEGETWRD LLSGDMFESD AEITLAPYQC RWITNQA