Gene Dvul_1804 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1804 
Symbol 
ID4662578 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp2109717 
End bp2110997 
Gene Length1281 bp 
Protein Length426 aa 
Translation table11 
GC content59% 
IMG OID639820045 
Productaromatic hydrocarbon degradation membrane protein 
Protein accessionYP_967248 
Protein GI120602848 
COG category[I] Lipid transport and metabolism 
COG ID[COG2067] Long-chain fatty acid transport protein 
TIGRFAM ID[TIGR03304] outer membrane insertion C-terminal signal 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.388514 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.581648 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGCGAC TTTGTACACT TCTTCTTGCC GCTGGCCTTC TGCTTGCGGC CTTTACGGGT 
TCGGCGTCCG CCGAGGGCTT TGCACTCTAT GAGTGGGGCG CGCGTGGCAA CGCGCTGGGT
GGTTCCATGG TCGGTCGTGC TGACGACCCT TCGGCCGTGG CGTTCAACCC GGCGGGCATC
ACACAGCTTG AGGGTACCCA TGTGATGGGC GGTTTCAGCG CCATCATCCC CAGCAGCACA
GTCGAAATCA ATCATGGCGG TAGAAGCTAC GAAGGCGAGG GCGCCTTCAA CATATGGGTT
CCTCCGCACG GTTACATGAC CACGCAACTC AGCGACAACA TGTGGCTGGG TATGGGCATA
TACACCCGTT TCGGCCTTGG TTCGGAGTAT GACGACTCAA GCTGGGGCGG GCGCTACAAT
ATCTATAATG TCGGCATCCA GACCGTGTCG TTCAACCCCA ACCTCGCCTT CAAGCTCACC
GACGACCTGT CGGCAGCCAT CGGTGTCGAG GTGATGGGGC TGAAGCTGGA CATGAAGAAG
AAAATCAATC CTACCGGCAA GGCCAACAAC GCTCCCGGGA ACAATCCTGC CGGAGACATC
GATTCAGACC TTGAGGCCGA CAGCTACGGA GTTGGCCTTA CGGCGGGCCT GCATTATCGC
CTCAACGACC AGTGGGCCGC GGGCGTGAGC TACAAGAGTC AGGTGAAGCA CAAGGCGCGC
GGCACCAATG ACTTCAGCAA TGTCCCTGCT GGACCGTTGC AGGGTGTCTA TGCCGATTGC
GATGTGAATG GTGTGGTCAT CCTTCCTGAC ATGATTTCCT TCGGCGTCGT CTACTATCCC
ACTCCGGAGT GGAGCATCGA AGCCGGTGCC ACACTCACGC GCTGGAGTCT CTATGACAAC
CTTCCCATCT ACCATTCTGC CCCTTTTGCT GGTTCCGGCA TGATGGTGAA CAGCGAGAAG
GAGTGGAACG ACGCTTGGCG CTACAACGTG GGCGTGGAAT ACAAGGCCAC CGACTGGATG
GACCTGCGTG TCGGCTACGT CTATGACGTG TCGCCCGTCG ATGACGACCA TGTGGACTAC
CTCATCCCCA CGCAGGACAG GCAACTGTAC AGCACCGGTG TGGGCTTCCA CTGGGATAGC
TACACCGTCG ACCTCTCGTA CACCTACATC GTCGCCAGCG AAGTCGAGTA CTCGCAACGC
ACATCGGAGG GCATCTTCGA GGGCAATTCC AAGGGTGGCC GGACGCATGT ACTCGGCCTC
TCGGTGGGAT ACACCTTCTA G
 
Protein sequence
MKRLCTLLLA AGLLLAAFTG SASAEGFALY EWGARGNALG GSMVGRADDP SAVAFNPAGI 
TQLEGTHVMG GFSAIIPSST VEINHGGRSY EGEGAFNIWV PPHGYMTTQL SDNMWLGMGI
YTRFGLGSEY DDSSWGGRYN IYNVGIQTVS FNPNLAFKLT DDLSAAIGVE VMGLKLDMKK
KINPTGKANN APGNNPAGDI DSDLEADSYG VGLTAGLHYR LNDQWAAGVS YKSQVKHKAR
GTNDFSNVPA GPLQGVYADC DVNGVVILPD MISFGVVYYP TPEWSIEAGA TLTRWSLYDN
LPIYHSAPFA GSGMMVNSEK EWNDAWRYNV GVEYKATDWM DLRVGYVYDV SPVDDDHVDY
LIPTQDRQLY STGVGFHWDS YTVDLSYTYI VASEVEYSQR TSEGIFEGNS KGGRTHVLGL
SVGYTF