Gene Smed_1803 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_1803 
Symbol 
ID5322661 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009636 
Strand
Start bp1887117 
End bp1888892 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content63% 
IMG OID640790741 
ProductTrkA domain-containing protein 
Protein accessionYP_001327473 
Protein GI150397006 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0471] Di- and tricarboxylate transporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00129836 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCGCCG AACAGTCCCT TTCGTTTATC GTGCTCGGCA TCATGATGAT CTTCTTCATG 
TGGGGGCGGT TTCGCTACGA TATCGTGGCC TGCTCGGCGC TGATGCTCGC CGTTGCCGTC
GGCATCGTTC CCTTCGATCA CGCCTTTGAC GGGTTCAGCG ACGATATCGT CATCATCGTC
GGCAGCGCGC TCATCGTCAG CGCGGGCGTG GCGCGTTCGG GGGTGGTCGA TGCCGCCATC
CAGCGCTTTC TGCCGGAGCT GCAGTCCGTG AGGGCGCAGC TTGCTCTTCT TGTGGTCACG
GTGACCATTC TTTCAGCCTT CATAAAGAAC ATAGGCGCGC TTGCGATCAT GATTCCGGTC
GCGTTCCAGT TCGCGCGTCG CTCGAAAGTG CAGCCTTCGG TATTTTTGAT GCCGATGGCG
TTCGGATCGC TGATCGGCGG GCTGATGACG CAGGTCGGCA CATCGCCGAA CGTGGTCGTG
TCGCGGGTGC GTGCGGATCT CACGGGTGAG AGCTTCACCA TGTTCGACTT CACGCCCGTC
GGTGCTTCGC TGGCTCTGGT CGGCGCGGTG TTCCTGCTGT TCGGTTACAA GCTGGTGCCG
GAGCGGAAGA GTCAGCAGGT CGGCATGGAT CAGGCGGTCG AGATCACCGA CTATACGTCC
GAGGCGGTGG TTCCTGCCGG TTCGCCTGTC ATCGGCAAGC CGCTCAGCAA TCTGGTCAAG
ATCGGCGACG GAGGAGCGGT CGTGATCGCC GTCTTCCGTC GCGGCACGCA TCTGGCGCCC
TTGCCCGACG TCGTCATCGA GCTGGACGAT ATTCTACTGC TCGAAGGTGG TCCTGCGGCG
CTCGATCGCA TCGTTTCCCA AGCGAAACTC AAGATTTCCG GCGATCGCTC GCCGATGCCG
AACGACAAGG CGAAAGCCGA TACCGAAGCA ATCGAGGCGG TGATCGCCAC CGGTTCCCCC
CTTATCGGCA TGTCGGCGCA GCGCCTGGCG CTCTTCAACA ATCACAACAT CAACCTCTTG
GCCGTCAGTC GCCAGGGCGA ACGGTTGAAA CAGAGGCTTG GCAGCATCCG GCTGCGCGCC
GGCGATATCG TCGTGCTGCA GGGCACCCGC AGGGAGTTGC CCTCGTTTCT TCAGGACTTC
GGCTGCCTGC CGCTCGCGCA ACGGGAAATC CTGCTGGGCA CCATCCGCCG TGCCACCGTG
CCCCTGCTTG TGCTGGCGAC CGCGATGGGC GCGACCGCGC TGGGCATCGT ACCTGTGCAG
ATCGCATTTT TTGCTGCCGC GCTTGCCATG GTGGTGTTTC GCGTTATTCC GCTACGCGAC
GTCTACCGTG CGGTGGACGG ACCGATCCTG GTCATGCTTG CCGCGCTCAT CCCGGTCAGC
GAGACGTTAC GCACGACAGG CGGTTCGGAT CTGATCGCGG GGTGGCTGAG CGGGATGGCT
GCTGATCTCC CTCCTGCCGG GGCGCTTGCC ATCATGGTCG TGGCTGCCAT GGCGGTCACC
CCCTTCCTCA ATAATGCCGC TACGGTTCTC GTCATGGCCC CCATAGCGGC CAGTTTTGCG
ACGGCGCTCG ATTACAGGCC GGACGCCTTC CTGATGGCAG TTGCGATCGG CGCCGGCTCG
GACTTCCTCA CCCCGATCGG CCACCAGTGC AATACGCTGG TGATGGGACC CGGCGGCTAC
CGCTTCAGCG ACTACCCGCG ACTGGGGCTA CCGCTCTCTA TCGTCATCGT GATCGTCGCG
GTGCCGATGC TGATGTGGAT TTGGCCCCTG CGCTAA
 
Protein sequence
MTAEQSLSFI VLGIMMIFFM WGRFRYDIVA CSALMLAVAV GIVPFDHAFD GFSDDIVIIV 
GSALIVSAGV ARSGVVDAAI QRFLPELQSV RAQLALLVVT VTILSAFIKN IGALAIMIPV
AFQFARRSKV QPSVFLMPMA FGSLIGGLMT QVGTSPNVVV SRVRADLTGE SFTMFDFTPV
GASLALVGAV FLLFGYKLVP ERKSQQVGMD QAVEITDYTS EAVVPAGSPV IGKPLSNLVK
IGDGGAVVIA VFRRGTHLAP LPDVVIELDD ILLLEGGPAA LDRIVSQAKL KISGDRSPMP
NDKAKADTEA IEAVIATGSP LIGMSAQRLA LFNNHNINLL AVSRQGERLK QRLGSIRLRA
GDIVVLQGTR RELPSFLQDF GCLPLAQREI LLGTIRRATV PLLVLATAMG ATALGIVPVQ
IAFFAAALAM VVFRVIPLRD VYRAVDGPIL VMLAALIPVS ETLRTTGGSD LIAGWLSGMA
ADLPPAGALA IMVVAAMAVT PFLNNAATVL VMAPIAASFA TALDYRPDAF LMAVAIGAGS
DFLTPIGHQC NTLVMGPGGY RFSDYPRLGL PLSIVIVIVA VPMLMWIWPL R