Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1662 |
Symbol | |
ID | 4029124 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | + |
Start bp | 1890949 |
End bp | 1891977 |
Gene Length | 1029 bp |
Protein Length | 342 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637966851 |
Product | tetratricopeptide TPR_2 |
Protein accession | YP_573714 |
Protein GI | 92113786 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.061862 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCGTATC CCCTGTTTTC GTCCGGATCG GTGCGCCGAC CGACATTGCG TCCGGGCCGC CTGGTGGCCG GGTTCGTCTG CCTGGCGGCA CTGGCGGGCT GCGCCGGCCC CGATACGCCG AATACCGCCT GGACGCAAAC CGATGGCCGC GCCAACGGCA TCGCCACCGC CAACGAAGAG GACTCGACGA CCCGGCGCGG CGCCTGCGGC GAACCCATGA GCGCCACCAC CGGCATGCAG ATCACGCTGA TCCGCCAGAT GCTCGACGAC GACAAGCCGC GCGCGGCGCT GGCGCACCTG GACAGCCTCT CGCTGGAGGC CACCGGCGCC GATCTCGCCG AGCCGCGACT GCTGCGCGGC GAAGCGCTGC GCCGCACGGG CAAGCGCGAC GCCGCCGACC GTATCTACGA CAGCCTCACC GCCACCTGCC TCGCCGCCGA TGCCTGGCGG GGCATCGCCC GCAACCAGGC CGCGCGGGGC GACATGGCCA CCGCGCTGGC TTCCATGCGC AAGGCACGCG AGTCGCGTCC CATCGACGCC GCCATTCGCA ACGACCTGGG CTACCTGCAC TTGCTGGAGG GCCAGCCGCG TCAGGCCGAG GAAGAATTCC TCACCGCGCT GGAGCTCTCG CCGAACTACG CTCGCGCCGC CCGCAACCTG ATCATGGCGC TGTATGCCCA GGGCAACGCA CGCCAGGCCA CCCGCATCGC CGAGCGCTAC AGCATCGGCC AGGACGACCT CGAGCTGCTG CGGCAGACGC GTCTGGCCTC GAATGCCACG GCGTCACCCG CGACCGCGAG GTCGGGTCCG ACCGCGGCCT CGGCATCGGC GACGTCGGTG TCGGCGACAA CGTCCGTCCC GGCTTCGGTC ACCGACGCGA TCACCGCCAG ACGCAGCGGC ACCTCGGCAC GCGCGTCCGG CGGCACGACG GCGACACGCA ACGAGGCCGA GACGAGCGGC GCCATCACCC ACACGCCCAT CGCCATGCCC GACAACACTG CCTTCGAGCT GGAAATCACC CGCAAATGA
|
Protein sequence | MPYPLFSSGS VRRPTLRPGR LVAGFVCLAA LAGCAGPDTP NTAWTQTDGR ANGIATANEE DSTTRRGACG EPMSATTGMQ ITLIRQMLDD DKPRAALAHL DSLSLEATGA DLAEPRLLRG EALRRTGKRD AADRIYDSLT ATCLAADAWR GIARNQAARG DMATALASMR KARESRPIDA AIRNDLGYLH LLEGQPRQAE EEFLTALELS PNYARAARNL IMALYAQGNA RQATRIAERY SIGQDDLELL RQTRLASNAT ASPATARSGP TAASASATSV SATTSVPASV TDAITARRSG TSARASGGTT ATRNEAETSG AITHTPIAMP DNTAFELEIT RK
|
| |