Gene Dtpsy_1362 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDtpsy_1362 
Symbol 
ID7382614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAcidovorax ebreus TPSY 
KingdomBacteria 
Replicon accessionNC_011992 
Strand
Start bp1423663 
End bp1424703 
Gene Length1041 bp 
Protein Length346 aa 
Translation table11 
GC content75% 
IMG OID643654680 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_002552826 
Protein GI222110562 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.00293733 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACCACGA ACCCTGGCGA CGGCCTGTCC CTCGCCCCGC TCGACCCCGC GCCCACGCAA 
ACCGCGCCTG CTGCGGCCTC ATCGCACGTC GCACGCACCT GCCCACGCTG CCACTACACG
CGCCAGGCCA GCGACACCGC CCCCGCCTGG CAATGCCCGC GCTGCGCGGT GGTCTACGAC
AAGGCCACGC CGCGCCCCCG CGCACGCGCG GGCCACGATG AAGAGGACGA TACCCAGCCC
GTCCCGCTGG ACCCGCGCCG CGCCCGGGCC AGCGCGGGCC TGCCCTGGGC CTGGATCGCC
CTGGGCAGCG CCGTGCTGGC CGGGGCCGCG CTGCTGGGCT GGAAGTGGAA CGGCGAGCGC
CAGCGCAGCG CCCAACAGGT GCAACGCGCC GCCAGCGACA GCCGCGCCGC CGATGTCAGC
CAGGCCCGCG CCGTGCAGGA CGGGCGGGCG CGCATCGATG CACTGGAACA CCAGTGGCGC
ATGGGCGAAG GCGCCCAGGC GCTGCCTGCC GTGCGCGCGC TGGCCGACGA GGGCGAGCCG
CGGGCCATGG TGCTGCTGGG CTCCATGCTG CTGGGTGGCA GCAGCTACCG CAACGCGATC
GGCCAGCCGC TGGACCCGGC CGAGGCGCAG CAATGGCTGG AGCGCGCCGC CCGCGCGGGC
GATGCCACTG CCGCCGTGCG CCTGGGGGGC CTGTATGAGC GCGGTGAACA TGTGCCGCGC
CAGCCCTCAC TGGCCGAGAA CTGGTACCTG CGTGCCGCGC GCCAGGGCGA CGGCGCGGGC
CTGTACAGCC TGGGCATGCT CTACGCGCGG GGCGCCGATC CCGTAAGCCA ACGCCCCGTC
CCCGCGTGGA TGCTGCTCAC GCTGGCCGAA CGCGCCTCGC GCGCCGCGCC GGAGCGCGAC
GCGCTGCTGA CCGAGCAGCA CTACCCGTCC AGCGCCCGTG CAGGCCCGGT CCGCCTCAAG
GACAAGCTGC ATCCCAGCGA CATTGCCGAG GCCGAGCGCC TGGCCGACGC CTGGAAGCCC
GGCCAGCCGC TGGGCTTCTA G
 
Protein sequence
MTTNPGDGLS LAPLDPAPTQ TAPAAASSHV ARTCPRCHYT RQASDTAPAW QCPRCAVVYD 
KATPRPRARA GHDEEDDTQP VPLDPRRARA SAGLPWAWIA LGSAVLAGAA LLGWKWNGER
QRSAQQVQRA ASDSRAADVS QARAVQDGRA RIDALEHQWR MGEGAQALPA VRALADEGEP
RAMVLLGSML LGGSSYRNAI GQPLDPAEAQ QWLERAARAG DATAAVRLGG LYERGEHVPR
QPSLAENWYL RAARQGDGAG LYSLGMLYAR GADPVSQRPV PAWMLLTLAE RASRAAPERD
ALLTEQHYPS SARAGPVRLK DKLHPSDIAE AERLADAWKP GQPLGF