Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Smed_6143 |
Symbol | |
ID | 5320445 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sinorhizobium medicae WSM419 |
Kingdom | Bacteria |
Replicon accession | NC_009621 |
Strand | + |
Start bp | 1070183 |
End bp | 1071829 |
Gene Length | 1647 bp |
Protein Length | 548 aa |
Translation table | 11 |
GC content | 53% |
IMG OID | 640777774 |
Product | type IV secretion system protein VirD4 |
Protein accession | YP_001314706 |
Protein GI | 150378111 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3505] Type IV secretory pathway, VirD4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 28 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTGTTCAC TGGCAGTAGG TTATTGTGGG GCAAGCGCCT ACTCAACGTT CCGTTTTGGT TTTGATGGAA GGGCTTTGAT GACATTTGAC ATCCTCGCTT TTTGGTATGA GACGCCTTTC TATCTGGGAT ACACAACTCT CTTCTTCTAT AGGGGTTTGG CTGTCGTCGT CTTAACGTCG GCAGCCATTC TGCTCGTTCA GCAGATGGTA TCCGTGCGCG ATCGGCAACA TCACGGTACT GCACGCTGGG CTCGCGTGGA TGAAATGCGG CGCGCGGGTT ATCTTCAGCG GTACAGCCGC ATCAGTGGAC CGGTGTTCGG AAAGACGAGC GGGCCTTTTT GGTCTGACTA CTACCTGACC AATAGCGAGC AGCCTCACAG CCTCATCGTT GCGCCGACGC GCGCGGGAAA AGGCGTCGGC ATTGTAATTC CAACGCTACT AACGTTCGAG GGCTCAGTGA TAGCCCTCGA CGTTAAGGGC GAGCTCTTTG ATCTCACTTC TAGGGCTCGC AAAGCGCGGG GCGATAGCGT GTTCAAATTG GCACCGCTAG ACCCCGAGCG GCGGACGAAT TGCTATAATC CTCTATTGGA TATCCTAGCA TTGCCGTCAG AGCGGCAGTT CACCGAAGCG CGTCGTTTGG CGGCAAACCT CATTGCGACC AAGGGACAGA GTGCGGAAGG TTTCATCAAC GGCGCACGAG ATCTTTTTGT CGCCGGCATT CTTGCCTGCA TCGAGCGCGG TACGCCAACC ATCGGGGCCG TCTACGACCT CTTTGCTCAG CCAGGGGAGA AATATAGCCT TTTTGCGCGT CTTGCGCAGG AGACCCAGAA TAAGGAGGCT CAGCGGATCT TCGATGAAAT GGCGAGTAAT GATACAAAGA TTCTGACCTC CTACACTTCT GTGCTCGGTG ATGGCGGGCT GAACTTATGG GCCGACCCGC TGATTAAAGC GGCGACAAGC CGATCGGACT TCTCAATCTA CGACCTGCGA CGTCGCAGGA CATGCATTTA TCTTTGCGTC AGCCCAAACG ATCTTGAGGT GGTCGCGCCT CTGATGCGCC TGCTCTTCCA GCAGGTCGTT TCAATTCTGC AACGCTCGCT GCCCGGCAAG GACGAGAAGC ACGAAGTGCT ATTCCTGCTC GATGAATTCA AGCATCTTGG TAAGCTCGAG GCGATTGAGA CTGCAATCAC CACGATCGCG GGCTACAAGG GCCGTTTTAT GTTCATTATC CAAAGTCTCT CGGCACTGAC CGGAACATAC GACGAATCTG GAAAGCAGAA TTTCCTCAGC AATACTGGTG TGCAGGTCTT TATGGCAACA GCTGACGATG AGACACCGGT TTATATTTCG AAAGCCATCG GTGAATATAC ATTCCAAGCG CGCTCAACTT CCTATACCCA AAGCCTTACG TTCGATCGCA ATATTCAACA CTCAGATCAA GGAGCACCTT TATTAAGGCC AGAACAGGTG CGTCTACTAC CCGACAAGTA CCAGATCGTT CTCATTAAGG GTCAGCCACC ATTGCAACTA CGAAAGGTAC GATATTATTC CGATCGCGCA CTGAAGCGCA TCTTTGATAG CCAGACGGGC AGGCTTCCGG AGCCAGCACC CCTGATGATT GCAGACGAGA GATTTAGCCA CGTCTAG
|
Protein sequence | MCSLAVGYCG ASAYSTFRFG FDGRALMTFD ILAFWYETPF YLGYTTLFFY RGLAVVVLTS AAILLVQQMV SVRDRQHHGT ARWARVDEMR RAGYLQRYSR ISGPVFGKTS GPFWSDYYLT NSEQPHSLIV APTRAGKGVG IVIPTLLTFE GSVIALDVKG ELFDLTSRAR KARGDSVFKL APLDPERRTN CYNPLLDILA LPSERQFTEA RRLAANLIAT KGQSAEGFIN GARDLFVAGI LACIERGTPT IGAVYDLFAQ PGEKYSLFAR LAQETQNKEA QRIFDEMASN DTKILTSYTS VLGDGGLNLW ADPLIKAATS RSDFSIYDLR RRRTCIYLCV SPNDLEVVAP LMRLLFQQVV SILQRSLPGK DEKHEVLFLL DEFKHLGKLE AIETAITTIA GYKGRFMFII QSLSALTGTY DESGKQNFLS NTGVQVFMAT ADDETPVYIS KAIGEYTFQA RSTSYTQSLT FDRNIQHSDQ GAPLLRPEQV RLLPDKYQIV LIKGQPPLQL RKVRYYSDRA LKRIFDSQTG RLPEPAPLMI ADERFSHV
|
| |