Gene Dtpsy_0844 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_0844 
Symbol 
ID7383539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp872141 
End bp873331 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content64% 
IMG OID643654156 
ProductZonular occludens toxin 
Protein accessionYP_002552322 
Protein GI222110058 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGCCCT TGCGCTCTCT GCGCCGCCAG CGCGGCTTTT TGTACCTTAC TACTGGCGGC 
AACGGCACCG GCAAGACGCT GTTTACGCTC TATGACGTGC GCAAGCTCCA GCTGGAGACT
GGTCGCCCGG TGTACTTTTC TGGCTTTGAG GCCAAGCAGC CGCTGCTGGA TTTTGGGTGG
CAACCCTTTG AGCCGGAGTG CTGGCAGGAT CTGCCTGACG GGTCGATCTG CTTGGTGGAC
GAGTGTCAGA AAGTCATGCC TGTGCGCGGC ACCGGCAAGC CGCCGGAGTG GATTGCGGCT
ATCGCCGAGG TGCATCGCAA ACGCGGCTTT GACTTCTTCC TGATCACGCA ACACCCGCTT
AATTTCGATT CGTTCGTCCG TCGTCTGGTG GCCGCGCCTG GTTGGCACCG GCACTTCAAG
GCGAGCTCCA TGGGTGACAG CTCCAATGAG CTGAAATGGT CCTCGGTTAA GGACAACCCG
CAGGTGGCCA ATAGCTCGGC CATGGGCGAG GTGACATCCC GCGCGTTCCC TCGGGAGGTG
TATGACTGGT ACGCCTCGTC CAGCCTGCAC ACGGCGCGCA AGCGGATCCC GCTGAAGGTC
TGGGGCGCCA TCGCGGGCGT CATTGCGGCG TTCGGCATGG TCGGTTTCGC CGTGTGGCAC
TTCCTCGGCT ACACCGGGGC GCCAGCGGCT GCTAAGCCGG CAGCTGCCCC GGAGGCGTCC
GCGCTGTCAA AGATGCTGAC CCCTGCTGCA GCGTCTGGTG TCGGTAGCTC GGAGCGGCAG
CCGCTGACGG TGGCTGAGTA CGTCGAGCAG CGCAAGCCGC GCCTGCCTGG GTTCCCCAAC
ACGGCCCCGG TGTATGACCA GGTCACGCAA CCCGTCGAGG CTCCGTACCC TGCAGCCTGT
GTCAAGATGG GCCAGCGTTG CGACTGCTAT ACCCAGCAAG CGACGCTGCT GCAGGTGGCT
CACGACGTGT GCATGCAGAT CGTCCAGCGC GGCTATTTCA TGGATTGGAA ACGTCCTACG
ACGGAGGCAG TCCGACAGCC GCGGCGCGAC GAACCAGTGC GCCAGGCTGC GCCAGTGCAG
TCGCCCGTGG TCATCAACAT GCCTGCCCAG GCCCAGCAGG TCCAGCCAGT GTCTGAATGG
TCACAAGGGC TCGCGGCGCG GAATGCTGAA GTGCGCTCCA TGGTGCGCTA G
 
Protein sequence
MRPLRSLRRQ RGFLYLTTGG NGTGKTLFTL YDVRKLQLET GRPVYFSGFE AKQPLLDFGW 
QPFEPECWQD LPDGSICLVD ECQKVMPVRG TGKPPEWIAA IAEVHRKRGF DFFLITQHPL
NFDSFVRRLV AAPGWHRHFK ASSMGDSSNE LKWSSVKDNP QVANSSAMGE VTSRAFPREV
YDWYASSSLH TARKRIPLKV WGAIAGVIAA FGMVGFAVWH FLGYTGAPAA AKPAAAPEAS
ALSKMLTPAA ASGVGSSERQ PLTVAEYVEQ RKPRLPGFPN TAPVYDQVTQ PVEAPYPAAC
VKMGQRCDCY TQQATLLQVA HDVCMQIVQR GYFMDWKRPT TEAVRQPRRD EPVRQAAPVQ
SPVVINMPAQ AQQVQPVSEW SQGLAARNAE VRSMVR