Gene Shewmr4_2552 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagShewmr4_2552 
Symbol 
ID4253123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameShewanella sp. MR-4 
KingdomBacteria 
Replicon accessionNC_008321 
Strand
Start bp3036976 
End bp3038952 
Gene Length1977 bp 
Protein Length658 aa 
Translation table11 
GC content50% 
IMG OID638119187 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_734680 
Protein GI113970887 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000125111 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones57 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCTATTA AGTCGAGTAT AAATCCACCG GTTTTTTATT CGTCGGTCTT TTTTATCACA 
TTAATGGTGA TGATCTGCGC CATTTGGCCG ACAGAGGCCA ATCACTTTTT TAAATCCATG
CAGTCGTGGT TAGAAGCAAA GGCGGGTTGG TTATATATCC TTGGCGTGGC GATTTTTCTA
ATTTTTATCA TCTTTGTCAT GGTCAGCCGC TTTGGTGATA TTAAGCTCGG CCCCGACCAT
GCGGTACCCG ACTATAGCTA TAAGAGCTGG ATTTCCATGC TGTTTTCGGC GGGGATGGGG
ATTGGGTTGA TGTTCTTTGG CGTCGCCGAA CCCGTGATGC ATTACTTGGC GCCGCCCGAT
GCAACGCCTG AGACCTTGGC CGCCGCCAAA GAGGCCATGA AGATCACTTT CTTCCATTGG
GGCATACATG CGTGGGCGAT TTACGGGGTG GTTGCCCTGA GTTTGGCCTA TTTTGCCTAT
CGGCATAAGT TGCCGCTCTT GCCCCGCAGC GCACTCTATC CCTTAATCGG TGAGCGCATC
CATGGCCCGA TTGGCCACTG CGTCGATACC TTTGCGGTAT TGGGGACTAT GTTCGGGGTG
GCGACCTCCC TAGGGTTTGG GGTGTTGCAG GTTAACTCGG GCTTAAGCTA TTTATTCGAG
CAACTGCCGA ATAACACCAC AGTGCAAGTA TCGCTGATTA TTGGCATTAC CCTACTCGCG
ACTCTGTCAG TATTTTCGGG CCTAGATAAA GGGGTGAAGC GCTTAAGTGA GCTTAACTTA
GGTTTAGCTC TTATTCTGCT GCTGATTGTG TTAATTCTTG GCCCAACCGT GATGTTGCTG
CAAGCCTTCG TACAAAATAC CGGTAGTTAT CTGAGTGATA TTGTTAATAA AACCTTTAAT
CTGTACGCCT ATCAGCATAA GGAAGACTGG CTGGGCGGTT GGACGCTACT CTACTGGGGT
TGGTGGATCT CCTGGTCACC TTTTGTCGGC ACCTTTATTG CGCGGGTGAG TCGTGGCCGC
ACCATACGGG AGTTTTTAGT CGGCATTTTA TTTGTGCCCT CGGCGCTGAC CTTTTTGTGG
ATGACGGTAT TTGGTAACTC GGCTATCGAC GCCATTATGA ATCAAGGCGC GACCTACCTG
AGCGATGCAG TGAATACCAA TGTGCCCGTG GCCTTGTTTG TGTTTTTTGA ACATATGCCG
TTCCCGCATC TCCTGTCGGG GATTGGGATT TGTTTAGTGG TCACCTTTTT TGTTACCTCA
TCGGATTCTG GATCGCTTGT AATCGACAAC CTAACCTCGG GAGGCGATAA CAATGCGCCC
GTCTGGCAAC GGGTGTTTTG GGCACTATTA CAAGGGGTAG TGGCCTCGGT GCTGCTGCTT
GCCGGCGGCT TACAGGCCTT ACAAACTGCG GCCATTGCCA GTGCCATGCC ATTTTTATGT
GTGATGCTGC TGATGTGTTT GGGGCTGTAT AAGGCGCTGA AAGACGATTG GCTTAAGATC
AACAGTGTGC AGATGCATAC CACCAGTGTG CAATACGCTA AGACCAATAT CAGTTGGGAG
GAGCGTATCG AAGTCTTAGT GTCCCATCCT ACCGAAGATG AGGCGCGAAT TTTCCTCAAT
AATGTCGCGA CACCGGCGTT ATCTAAGGTG TGTCAGCAAT TTATGACCAA GGGCCTGACG
GCGGATCTCG AGTATCTCGA CGGCAAAGTG CGATTGGTGA TCAGCAACGA AGCCTATCAA
CCCTTCGTAT ACGGTGTGCG GATGCGCTGT TTTGAAATCG TCAATCCGGT TGGTGAAGAG
TTAGAGCAGG GCAACAATTG GTACTATCGC GCCGAAGTGT ACTTAGAGCA GGGCGGCCAA
CACTATGATG TGATGGGCTA TACCGAAGAG CAGCTATTGG CTGACGTGGT TACCCAATAC
GAGAAATACC TGCACTATTT GCACTTGTCC AATGCCGACC AAGGACATGT AACCTAA
 
Protein sequence
MSIKSSINPP VFYSSVFFIT LMVMICAIWP TEANHFFKSM QSWLEAKAGW LYILGVAIFL 
IFIIFVMVSR FGDIKLGPDH AVPDYSYKSW ISMLFSAGMG IGLMFFGVAE PVMHYLAPPD
ATPETLAAAK EAMKITFFHW GIHAWAIYGV VALSLAYFAY RHKLPLLPRS ALYPLIGERI
HGPIGHCVDT FAVLGTMFGV ATSLGFGVLQ VNSGLSYLFE QLPNNTTVQV SLIIGITLLA
TLSVFSGLDK GVKRLSELNL GLALILLLIV LILGPTVMLL QAFVQNTGSY LSDIVNKTFN
LYAYQHKEDW LGGWTLLYWG WWISWSPFVG TFIARVSRGR TIREFLVGIL FVPSALTFLW
MTVFGNSAID AIMNQGATYL SDAVNTNVPV ALFVFFEHMP FPHLLSGIGI CLVVTFFVTS
SDSGSLVIDN LTSGGDNNAP VWQRVFWALL QGVVASVLLL AGGLQALQTA AIASAMPFLC
VMLLMCLGLY KALKDDWLKI NSVQMHTTSV QYAKTNISWE ERIEVLVSHP TEDEARIFLN
NVATPALSKV CQQFMTKGLT ADLEYLDGKV RLVISNEAYQ PFVYGVRMRC FEIVNPVGEE
LEQGNNWYYR AEVYLEQGGQ HYDVMGYTEE QLLADVVTQY EKYLHYLHLS NADQGHVT