Gene Dtpsy_3539 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_3539 
Symbol 
ID7384665 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp3789216 
End bp3790385 
Gene Length1170 bp 
Protein Length389 aa 
Translation table11 
GC content62% 
IMG OID643656856 
Productprotein of unknown function DUF1016 
Protein accessionYP_002554962 
Protein GI222112698 
COG category[S] Function unknown 
COG ID[COG4804] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAGA AGACGCCTGC GGCGGCGGGA GCCGCACCCG CCGCTTTGCC TGCCGGCTAC 
GCCGGCATCC ACAGCGGCAT CGTGGAGCTG CTTGGCGCTG CGCGCCAGGC GGCGGCACGC
AGCGTCAATG CGCTGATGAC GGCGAGCTAT TGGGAGATTG GCCGCCGCAT AGTGGAGGCC
GAGCAACAGG GCAAACGACG TGCGGGTTAC GGTGAGCAGT TGATGGAGCG ACTGTCCACT
GATTTGACCG CGCAGTTTGG GCGGGGCTTT GGCGTGAACA ACCTGGAGAA CATGCGGCGG
TTCTTCCTCG CATACCCTGT CTCCGAGATT TCCCAGACAC TGTCTGGGAA ATTGGACAAC
GAGCTGCCCG ACGAGAAATC CCAGACAGTG TCTGGGAAAT TGAGCCTCAC CGAGCTGGCA
CAGGTGTTCA CGCTGCCGTG GTCGGCCTAT GTGCGGCTGC TGGTGGTCAA GGACAACCAT
GCCCGGCGCT TCTACGAAGC CGAGGCACTG CGCGGCGGCT GGAGCGTGCG CCAGCTTGAC
CGGCAGATTG GCAGCCAGTT TTACGAGCGC ACCGCCTTGT CCAAGGATAA GGCGGCGATG
CTGGTCAAGG GAGCGGTGGC GAGGCCCGAG GATGCCGTCA CGCCCGACGA CGCGATCAAA
GATCCGTATG TGCTGGAGTT CCTGAATCTC AAGGACGAGT ATTCGGAATC CGATCTGGAG
GCCGCGTTGA TCCAGCGGCT GGAGGATTTT CTGCTGGAGC TGGGCGAAGG CTTCACCTTC
GTCGGCCGGC AGCGGCGCTT GCGCATTGAC CAGACCTGGT ATCGGGTCGA TCTGCTGTTC
TACCATCGCA AGTTGCGTTG CTTGGTCATC ATCGACTTGA AGCTGGGCAG CCTGACCCAT
GCGGACGTGG GCCAGATGCA CATGTATTGC AACTACGCCA AGGAGCATTG GGCCTATCCC
GATGAGAACC CGCCCGTGGG GTTGATTCTC TGTGCTGACA AGGGCCATGC GCTGGCGCGG
TATGCCTTGG AAGGTTTGCC GACGAAGGTG ATGGCGGCGA ACTACCGTAC CGTGTTGCCA
GATGCCGAGC TGTTGCAGAA AGAGCTGGAA ACCACGCGGC GCTTGCTGGA GTCGCGTGTG
GTGAGGCAGC CCAAGAAGCT CCGGCAATAA
 
Protein sequence
MIKKTPAAAG AAPAALPAGY AGIHSGIVEL LGAARQAAAR SVNALMTASY WEIGRRIVEA 
EQQGKRRAGY GEQLMERLST DLTAQFGRGF GVNNLENMRR FFLAYPVSEI SQTLSGKLDN
ELPDEKSQTV SGKLSLTELA QVFTLPWSAY VRLLVVKDNH ARRFYEAEAL RGGWSVRQLD
RQIGSQFYER TALSKDKAAM LVKGAVARPE DAVTPDDAIK DPYVLEFLNL KDEYSESDLE
AALIQRLEDF LLELGEGFTF VGRQRRLRID QTWYRVDLLF YHRKLRCLVI IDLKLGSLTH
ADVGQMHMYC NYAKEHWAYP DENPPVGLIL CADKGHALAR YALEGLPTKV MAANYRTVLP
DAELLQKELE TTRRLLESRV VRQPKKLRQ