Gene Smed_6143 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSmed_6143 
Symbol 
ID5320445 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSinorhizobium medicae WSM419 
KingdomBacteria 
Replicon accessionNC_009621 
Strand
Start bp1070183 
End bp1071829 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content53% 
IMG OID640777774 
Producttype IV secretion system protein VirD4 
Protein accessionYP_001314706 
Protein GI150378111 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones28 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGTTCAC TGGCAGTAGG TTATTGTGGG GCAAGCGCCT ACTCAACGTT CCGTTTTGGT 
TTTGATGGAA GGGCTTTGAT GACATTTGAC ATCCTCGCTT TTTGGTATGA GACGCCTTTC
TATCTGGGAT ACACAACTCT CTTCTTCTAT AGGGGTTTGG CTGTCGTCGT CTTAACGTCG
GCAGCCATTC TGCTCGTTCA GCAGATGGTA TCCGTGCGCG ATCGGCAACA TCACGGTACT
GCACGCTGGG CTCGCGTGGA TGAAATGCGG CGCGCGGGTT ATCTTCAGCG GTACAGCCGC
ATCAGTGGAC CGGTGTTCGG AAAGACGAGC GGGCCTTTTT GGTCTGACTA CTACCTGACC
AATAGCGAGC AGCCTCACAG CCTCATCGTT GCGCCGACGC GCGCGGGAAA AGGCGTCGGC
ATTGTAATTC CAACGCTACT AACGTTCGAG GGCTCAGTGA TAGCCCTCGA CGTTAAGGGC
GAGCTCTTTG ATCTCACTTC TAGGGCTCGC AAAGCGCGGG GCGATAGCGT GTTCAAATTG
GCACCGCTAG ACCCCGAGCG GCGGACGAAT TGCTATAATC CTCTATTGGA TATCCTAGCA
TTGCCGTCAG AGCGGCAGTT CACCGAAGCG CGTCGTTTGG CGGCAAACCT CATTGCGACC
AAGGGACAGA GTGCGGAAGG TTTCATCAAC GGCGCACGAG ATCTTTTTGT CGCCGGCATT
CTTGCCTGCA TCGAGCGCGG TACGCCAACC ATCGGGGCCG TCTACGACCT CTTTGCTCAG
CCAGGGGAGA AATATAGCCT TTTTGCGCGT CTTGCGCAGG AGACCCAGAA TAAGGAGGCT
CAGCGGATCT TCGATGAAAT GGCGAGTAAT GATACAAAGA TTCTGACCTC CTACACTTCT
GTGCTCGGTG ATGGCGGGCT GAACTTATGG GCCGACCCGC TGATTAAAGC GGCGACAAGC
CGATCGGACT TCTCAATCTA CGACCTGCGA CGTCGCAGGA CATGCATTTA TCTTTGCGTC
AGCCCAAACG ATCTTGAGGT GGTCGCGCCT CTGATGCGCC TGCTCTTCCA GCAGGTCGTT
TCAATTCTGC AACGCTCGCT GCCCGGCAAG GACGAGAAGC ACGAAGTGCT ATTCCTGCTC
GATGAATTCA AGCATCTTGG TAAGCTCGAG GCGATTGAGA CTGCAATCAC CACGATCGCG
GGCTACAAGG GCCGTTTTAT GTTCATTATC CAAAGTCTCT CGGCACTGAC CGGAACATAC
GACGAATCTG GAAAGCAGAA TTTCCTCAGC AATACTGGTG TGCAGGTCTT TATGGCAACA
GCTGACGATG AGACACCGGT TTATATTTCG AAAGCCATCG GTGAATATAC ATTCCAAGCG
CGCTCAACTT CCTATACCCA AAGCCTTACG TTCGATCGCA ATATTCAACA CTCAGATCAA
GGAGCACCTT TATTAAGGCC AGAACAGGTG CGTCTACTAC CCGACAAGTA CCAGATCGTT
CTCATTAAGG GTCAGCCACC ATTGCAACTA CGAAAGGTAC GATATTATTC CGATCGCGCA
CTGAAGCGCA TCTTTGATAG CCAGACGGGC AGGCTTCCGG AGCCAGCACC CCTGATGATT
GCAGACGAGA GATTTAGCCA CGTCTAG
 
Protein sequence
MCSLAVGYCG ASAYSTFRFG FDGRALMTFD ILAFWYETPF YLGYTTLFFY RGLAVVVLTS 
AAILLVQQMV SVRDRQHHGT ARWARVDEMR RAGYLQRYSR ISGPVFGKTS GPFWSDYYLT
NSEQPHSLIV APTRAGKGVG IVIPTLLTFE GSVIALDVKG ELFDLTSRAR KARGDSVFKL
APLDPERRTN CYNPLLDILA LPSERQFTEA RRLAANLIAT KGQSAEGFIN GARDLFVAGI
LACIERGTPT IGAVYDLFAQ PGEKYSLFAR LAQETQNKEA QRIFDEMASN DTKILTSYTS
VLGDGGLNLW ADPLIKAATS RSDFSIYDLR RRRTCIYLCV SPNDLEVVAP LMRLLFQQVV
SILQRSLPGK DEKHEVLFLL DEFKHLGKLE AIETAITTIA GYKGRFMFII QSLSALTGTY
DESGKQNFLS NTGVQVFMAT ADDETPVYIS KAIGEYTFQA RSTSYTQSLT FDRNIQHSDQ
GAPLLRPEQV RLLPDKYQIV LIKGQPPLQL RKVRYYSDRA LKRIFDSQTG RLPEPAPLMI
ADERFSHV