Gene Spro_1805 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSpro_1805 
Symbol 
ID5605212 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSerratia proteamaculans 568 
KingdomBacteria 
Replicon accessionNC_009832 
Strand
Start bp1988172 
End bp1990166 
Gene Length1995 bp 
Protein Length664 aa 
Translation table11 
GC content55% 
IMG OID640937337 
Productcholine/carnitine/betaine transporter 
Protein accessionYP_001478036 
Protein GI157370047 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1292] Choline-glycine betaine transporter 
TIGRFAM ID[TIGR00842] choline/carnitine/betaine transport 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.376035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00814736 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCCAAA ACGACACTGA AATTGTGGCA CGTCGTAGCC ATATCAATCC TCCGGTATTT 
TTTATTTCTG CCGCTTTAAT CATCGTCCTG GTGGCCTTTG CCGCCCTTTA CCCCGAGTTG
GCCGATCGCA AATTCAAAGC GTTGCAACTG GGTATTTTCA CTAACGCCAG CTGGTTTTAT
ATTCTGGCGG TGGCATTGAT CCTGATGAGC GTGACCTATC TGGGTTTGTC GCGTTACGGC
AATATCAAGC TCGGGCCGGA TCACGCCCAG CCAGACTTTA GCTATGTTTC CTGGTTCGCC
ATGCTGTTTT CCGCCGGGAT GGGCATTGGC CTGATGTTCT TTGGCGTAGC CGAACCCGTA
ATGCATTACC TTTCTCCCCC AGTCGGAACG CCGGAAACAG TGGAAGCGGC TAAACAGGCC
ATGCGCCTAA CCTTCTTCCA CTGGGGCCTG CACGCCTGGG CAATCTACGC CATCGTGGCG
TTGATTCTGG CCTTCTTCAG CTATCGCCAC GACCTGCCGC TGACGCTGCG TTCCGCGCTG
TATCCGATTA TCGGCGATCG CATCTATGGG CCAATTGGCC ACGCGGTAGA TATTTTTGCG
GTGATAGGTA CCGTGTTCGG TGTCGCAACC TCGCTGGGCT ACGGTGTGTT GCAGGTTAAC
GCCGGTCTTA ACCACCTGTT TGGCTTGCCG ATCAACGCAA CGGTGCAGGT GATTCTGATC
GTGGTGATCA CCGGGTTGGC TACGCTGTCA GTGGTTTCCG GGCTGGATAA AGGCATCCGC
ATTCTGTCCG AATTGAACCT CGGCCTGGCG GTGTTGCTGC TGGTGCTGGT GGCTGCGCTC
GGCCCGACCG TGCTGCTGTT AAAATCCTTT GTCGAAAATA CCGGCGGTTA CCTTTCTGAC
ATCGTCAGCA AGACCTTTAA CCTGTATGCC TATGAGCCGA AGTCCAGTAA CTGGCTGGGC
GGTTGGACAC TGCTGTATTG GGGCTGGTGG TTGTCATGGT CGCCTTTCGT CGGCATGTTT
ATTGCCCGCG TTTCACGCGG ACGTACCATC CGTGAATTCG TTACCGGCGT GCTGTTCGTC
CCTGCCGGCT TTACGCTACT GTGGATGACC GTGTTCGGTA ATAGTGCCAT TAATCTGATC
ATGGCCGAAG GTGCGCGCGA TTTGGCTAAC ACGGTGCAAA ATGACGTAGC ACTGGCACTG
TTCAACTTCC TGGAACACTT CCCGTTTTCT AATATCTTGT CGTTTATCGC GATGGCCATG
GTGGTAGTGT TCTTCGTCAC TTCCGCGGAT TCGGGGGCGA TGGTGGTAGA TACCCTGGCT
TCCGGGGGTA CCGACCAAAC TCCGGTCTGG CAGCGGATTT TCTGGGCCGG CATGATGGGC
CTGGTCGCCA TTGCCTTGCT GCTCGCCGGT GGGCTGAGCG CGCTGCAAAC CGTGACCATC
GCCAGTGCCC TGCCGTTCTC GATGATCTTG CTGGTGTCGA TCTACGGGCT GTTAAAAGCC
TTGCGTATCG ATGCCCACAA ACGCGACAGC CAGACGACGA CCACGATTGC ACCGACTGCC
GCACGCAATC CGATCTCGTG GCAACGGCGT TTACGCAATA TTGCCTATTT CCCTAAACGC
TCACAGGTCA AACGCTTTGT CGGTGAAGTG GTTCAGCCGG CGATGGCGCT GGTGGAAGCC
GAATTGGCCA AACAGAACAC CACCTCCTCG ATTGACGATT CACAGGACGA TCGCATCCGT
TTCGAAGTTG ATTTGGGTGA AGACCTGAAC TTTGTCTATG AGGTTCGCCT GCGGGCCTAC
ATTCAACCGG CCTTTGCGTT GGCGGGCTTG AAAGACGAAG AACGTGATGA GGAACATAAG
TACTATCGGG GTGAAGTCCA CCTGAAGGAA GGGGGCCAGG ATTACGATGT GATGGGCTGG
ACGCAAGAAC AGATCATTCA CGATATTCTC GACCAGTACG AGAAACACCT GCACTTCCTG
CACCTGGTAC GCTAG
 
Protein sequence
MSQNDTEIVA RRSHINPPVF FISAALIIVL VAFAALYPEL ADRKFKALQL GIFTNASWFY 
ILAVALILMS VTYLGLSRYG NIKLGPDHAQ PDFSYVSWFA MLFSAGMGIG LMFFGVAEPV
MHYLSPPVGT PETVEAAKQA MRLTFFHWGL HAWAIYAIVA LILAFFSYRH DLPLTLRSAL
YPIIGDRIYG PIGHAVDIFA VIGTVFGVAT SLGYGVLQVN AGLNHLFGLP INATVQVILI
VVITGLATLS VVSGLDKGIR ILSELNLGLA VLLLVLVAAL GPTVLLLKSF VENTGGYLSD
IVSKTFNLYA YEPKSSNWLG GWTLLYWGWW LSWSPFVGMF IARVSRGRTI REFVTGVLFV
PAGFTLLWMT VFGNSAINLI MAEGARDLAN TVQNDVALAL FNFLEHFPFS NILSFIAMAM
VVVFFVTSAD SGAMVVDTLA SGGTDQTPVW QRIFWAGMMG LVAIALLLAG GLSALQTVTI
ASALPFSMIL LVSIYGLLKA LRIDAHKRDS QTTTTIAPTA ARNPISWQRR LRNIAYFPKR
SQVKRFVGEV VQPAMALVEA ELAKQNTTSS IDDSQDDRIR FEVDLGEDLN FVYEVRLRAY
IQPAFALAGL KDEERDEEHK YYRGEVHLKE GGQDYDVMGW TQEQIIHDIL DQYEKHLHFL
HLVR