Gene Mpe_A1677 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpe_A1677 
SymbolvirD4 
ID4785762 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethylibium petroleiphilum PM1 
KingdomBacteria 
Replicon accessionNC_008825 
Strand
Start bp1802893 
End bp1804122 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content69% 
IMG OID640090250 
Producttype IV secretory pathway, VirD4 component 
Protein accessionYP_001020874 
Protein GI124266870 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3505] Type IV secretory pathway, VirD4 components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.250508 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCAGCG TTGCAGCCCT TCCGTTCTCG GCCTGGCCCA CGGGCCGCAA GGTCGCGGCC 
GGCGCCTTCG CCGTCATGGG CTACATCGCA CTGGCCTGCG CGGCGGTCTA CCTGGCCGGT
GTGCTGTTCT TGGTTCTGAA CAAGGCGAAC CCCAAGCAGG CGCAGTTCGC CAGCATCGTT
CACTACTGGG GCCTCTACGC CGACGATGCC CAGCTCCGCA AGAAGCTGCA ACTCGCGATC
GGCGTGTCGG GGATCGGCCT GTTGATCCTG CTGCCAGCCG GTCTGGTCGC CGCCGCACGT
CCGCGGCGGG CCCTGCATGG CGATGCCCGC TTCGCCAGCC CCGCCGAGGT TGATCGCGCG
GGCCTCACGG GCGGCGACGG GCAGCCGGGC ATCCTCATCG GCCGCCACCG GGGCAAGTTC
CTGTCGCTTC CTGGCCAGCT CTCAGTGATG CTGTCGGCGC CGACCCGCAG CGGCAAGGGC
GTGGGCGTCG TGATCCCGAA CCTGCTCAAC TGGCCCGACT CCGTCGTCGT GCTTGACATC
AAGGGCGAGA ACTACGACAT CACGGCCGGC TACCGCGCCG CACACGGGCA GGCGGTCTAC
GCCTTCTCGC CGTTTGACGA GGACGCGCGC AGCCAGCGTG ACGCCGACGA GTACAGCGCC
ATGCTCGGCC ACTTCACCGA GCGCGCCACC TCGCGCGGGC ACAGCCGATC CTTCAGCGGC
CACGGGCACA GCACCGTCAG CCGCAACGAG AGCGAGCAGC GCCGCGCGCT GCTGCTGCCG
CAGGAGTTCA AGGAGCTGGG CAGCGAGCGC CTGGTGGTGA TATTCGAGAA CTGCAAGCCG
ATCCTCGGCG AGAAGATCCG CTACTACCGC GACAAGGCCT TCACCTCGCG CCTGCGGCCG
GCACCGGCGG TGCCGCGCAT GAACATGGAC CTGCACCTGG CCCGCGTGCA GGAGCGCTGG
CGCTACGTGG ACGACGAGCT GGGCCCGGGC GATGGCTTGG ACTACGAGCA ACTGGCCTAC
GACATGAGCC GGCTGCCCGC ACTCGCCGAT GGCGAGCCCG GCCAGGTCGC CGAAGGCATC
CTCGACTTCA TGGTCGGCCC TCGGCCGGGC GGAGCCTCGA ACGGCGGCGC GATCGAAGCC
GTCGCCGACG AAGACGACGT CCTGCTCAGC GAAGACAGCA CCGTGGTCAT CGCCGATCCG
TCCGTCATCG AACGCGCCGA CATCACCTAG
 
Protein sequence
MSSVAALPFS AWPTGRKVAA GAFAVMGYIA LACAAVYLAG VLFLVLNKAN PKQAQFASIV 
HYWGLYADDA QLRKKLQLAI GVSGIGLLIL LPAGLVAAAR PRRALHGDAR FASPAEVDRA
GLTGGDGQPG ILIGRHRGKF LSLPGQLSVM LSAPTRSGKG VGVVIPNLLN WPDSVVVLDI
KGENYDITAG YRAAHGQAVY AFSPFDEDAR SQRDADEYSA MLGHFTERAT SRGHSRSFSG
HGHSTVSRNE SEQRRALLLP QEFKELGSER LVVIFENCKP ILGEKIRYYR DKAFTSRLRP
APAVPRMNMD LHLARVQERW RYVDDELGPG DGLDYEQLAY DMSRLPALAD GEPGQVAEGI
LDFMVGPRPG GASNGGAIEA VADEDDVLLS EDSTVVIADP SVIERADIT