Gene Tpau_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpau_1101 
Symbol 
ID9155241 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTsukamurella paurometabola DSM 20162 
KingdomBacteria 
Replicon accessionNC_014158 
Strand
Start bp1127416 
End bp1128975 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content70% 
IMG OID 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_003646072 
Protein GI296138829 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGACGAC TGGTCATCAT CCTCATCGGA ACAGCACTGT TGGCCCTCGC CGGCGGACCG 
CCGGCGGGGG CGATCATCGG CGGGAAGGTC GCCCCGGAGG CGCCGCCGTC CCTTGGCTCT
CTGCAGGTCG AGCTGCCCGG CGATGCGGTG ACCCCCGATA ACCACCGCTG CGGCACGACG
CTGATCGCAC CGCAGTGGGT GGTCACCGCG AGCCACTGCG CGATCCTCGG CAACGCCGGC
GATGCGGCCA CCGGCCCGAT GCGCGCCTCC GTGCGGATCG GGTCGACGAA CACGGGCACG
GGCGGTGAAC TGGTGGGTGT CGATCGCTTC TATCGCCTGG GCACCCCGAC CGAAGCGCGC
GAAGACGGAT ACGACTGGCT GCTCCGCGAT ATCGCGCTGC TACGGCTCGA ACGACCGGTG
CGTGCCACCC CGATCCCCAT CGCATCGAAG ACCCCCGCCG CGGGCACGCC CGCCCGGATC
ATGGGGTGGG GCTCCACGTG TTCGGATCCG GCGAAGATGC GTGATCCCCA GTGTTATCCG
AGCGAGCTCC GCACCGCCGA GACCGCAGTG CAGGATGCCC GCGCCTGCCC GAACATCCAG
GGCTCGGATA CGCCGCGCCC GCTGTGCATC GGCGGCAAGG ACGGGCGGCC GACCATCGGC
AACGCCGATT CCGGCTCGCC GGCCCTGGTG CAGGAGAACG GCGCATGGGT CATCGCGGGC
GTGGTCAGCG GTCCGGGCAC GAACGACGAC AAGGGGCCCG GTCTCTACAT GGACCTCACC
CAGCAGCGGG ACTGGATCGA CAGCATCATC AACAACACGA TCGTCCCCGA CGCGCCGCCG
ACCCCGGACG TCGCCGGCGC GGTGCTCCTG GGCAACTGCA TCGGCTCGAT CGTGCGGCCG
CCCGGGGCCA CTCCGGATGC CCCCGCCATG GTGCTCACCA ATGGGCACTG CGTCAGCGGC
GACCGGCCGG CGCCGGGTGG GGCAACGGTG AACCAGCCCT CGAACCGCAC CATGCTGGCG
GCGGGCCGCA CCGGCGAGTC GGTGACCACG GTGCGCGCCG ATCGTCTCGT CTACGCCACC
ATGAGCCGCA CCGATGTCGC GGTCTACCGC CTCGACTCGA CCTATGCCCA GGTGGCCGCG
CGCGGCGCCA CCGTCTTCGA TCTCGCGACG ACACCCATCC GTCCCGGAGA TCGGTTCTCC
ATGAACACCG GTGCCGCACG CAAGTCCTGT TCGGCCGAGG CCGTGGTACC GACTGTGCGA
GAAGGCGACT GGGAACAGCG AGACTCGGTG CGGTATCGGG ACTGCTCGTC GGTGCCCGGC
GAGTCCGGAT CGCCGCTGAT CTCACCCGAC GGCCGCACCG TGGTCGGCGT CAACAACAGC
TCCAACACCG ACGGCGAGAA GTGCACCGAC GACAACCCGT GCGAGATCGC GGCCGACGGC
ACCGTGACCG CGGTCAAGGG GCGCTCCTAC GGCCAGCAGA TCGACGCGCT CGCACGGTGC
CTGACCCGGG ACTCGATCGA CCTCTCCCGG CCCGGTTGCG ACCTACCAGG TGCGGCCTGA
 
Protein sequence
MRRLVIILIG TALLALAGGP PAGAIIGGKV APEAPPSLGS LQVELPGDAV TPDNHRCGTT 
LIAPQWVVTA SHCAILGNAG DAATGPMRAS VRIGSTNTGT GGELVGVDRF YRLGTPTEAR
EDGYDWLLRD IALLRLERPV RATPIPIASK TPAAGTPARI MGWGSTCSDP AKMRDPQCYP
SELRTAETAV QDARACPNIQ GSDTPRPLCI GGKDGRPTIG NADSGSPALV QENGAWVIAG
VVSGPGTNDD KGPGLYMDLT QQRDWIDSII NNTIVPDAPP TPDVAGAVLL GNCIGSIVRP
PGATPDAPAM VLTNGHCVSG DRPAPGGATV NQPSNRTMLA AGRTGESVTT VRADRLVYAT
MSRTDVAVYR LDSTYAQVAA RGATVFDLAT TPIRPGDRFS MNTGAARKSC SAEAVVPTVR
EGDWEQRDSV RYRDCSSVPG ESGSPLISPD GRTVVGVNNS SNTDGEKCTD DNPCEIAADG
TVTAVKGRSY GQQIDALARC LTRDSIDLSR PGCDLPGAA