Gene Dvul_1403 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDvul_1403 
Symbol 
ID4665022 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDesulfovibrio vulgaris DP4 
KingdomBacteria 
Replicon accessionNC_008751 
Strand
Start bp1704200 
End bp1705330 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content64% 
IMG OID639819634 
Productglycine cleavage system T protein 
Protein accessionYP_966848 
Protein GI120602448 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0404] Glycine cleavage system T protein (aminomethyltransferase) 
TIGRFAM ID[TIGR00528] glycine cleavage system T protein 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCATA CAACCCTGCA ACCCAGCGAC AAGGAGACGG AGTTGTCTGA CCTTCTGCTG 
ACCCCACTCA ATGCATGGCA TCGCGCCCAA GGTGCGAAGA TGGCCCCCTT CGCCGGTTGG
GACATGCCCA TCCAGTATGA GGGCATCCTC GCCGAACATC AGCACACCCG CACCCATGCG
GCCCTGTTCG ACATATGCCA CATGGGCGAA TTCGCCCTGC GCGGCCCCGG CGCGAAGCAG
GCCCTCGCAA GGGCCGTCAC CCACAATCTC GAGACGCTCA AGCCCGGACG CTGCCGCTAC
GGATTCCTTC TCAATGAGGC AGGCTGCGTG CTTGACGACC TCATCGTCTA CTGTCTGGCC
GAAGACGACT ACATGCTCGT CGTCAACGGG GCCTGCATAG CGTCAGACTT CGCCGCCTTG
CGCGAACGCC TTCCCGCATC GCTGCACTTC GAGGACATCT CCGCAGCCAC CGCAAAACTC
GACCTGCAGG GGCCGAAATC CATCGACGCA CTGGAGGGGC TTCTGGGGAG GAGTTTCCGC
GAACTCGGCT ACTTCGCCTT CACGCACACC ACCTTCGACG GTGCCAATCT CATGGTGAGC
CGTACCGGAT ACACCGGAGA ACTCGGTTAC GAGCTCTATC TCCCATGGGA CAAGGCAGAG
ACCCTGTGGA CGCGCCTTCT CGAGAATGCC GACGTCAAGC CTGCGGGCCT TGGCGCACGC
GACACCCTGC GACTCGAAGT CGGGCTTCCC CTCTACGGGC AGGACCTCGA CACGACCCAC
ACCCCAGCGG AAGCAGGCTA CGAAGGCATG CTGACGAACA CCGTCGACTA TGTGGGCAAA
GGGCGTGACC GTGAGGTGCG CGAGGTACTC GTCCCCCTCG CCATTCCGGG CAGGCGTGCG
GCACGGCATG GCGACGCGGT GGCCCTTCCC GACGGCACTG TCGTCGGCGT CGTCACCAGC
GGTTCGTTCG CCCCCAGTGT CGGACATGCC GTCGCCCTCG CCTACGTCAA GAGGCCCCAT
GCCGAAGAGG ACTCGTTCAT CATCAAGGCC GCAAGGGTTG AGCTCGAAGC GAAGCGGGCA
CCCCTGCCCT TCTACGCTGG CGGCACCGCC CGCATGAAGC TGCAGGGTTG A
 
Protein sequence
MTHTTLQPSD KETELSDLLL TPLNAWHRAQ GAKMAPFAGW DMPIQYEGIL AEHQHTRTHA 
ALFDICHMGE FALRGPGAKQ ALARAVTHNL ETLKPGRCRY GFLLNEAGCV LDDLIVYCLA
EDDYMLVVNG ACIASDFAAL RERLPASLHF EDISAATAKL DLQGPKSIDA LEGLLGRSFR
ELGYFAFTHT TFDGANLMVS RTGYTGELGY ELYLPWDKAE TLWTRLLENA DVKPAGLGAR
DTLRLEVGLP LYGQDLDTTH TPAEAGYEGM LTNTVDYVGK GRDREVREVL VPLAIPGRRA
ARHGDAVALP DGTVVGVVTS GSFAPSVGHA VALAYVKRPH AEEDSFIIKA ARVELEAKRA
PLPFYAGGTA RMKLQG