Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Csal_1527 |
Symbol | |
ID | 4029229 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chromohalobacter salexigens DSM 3043 |
Kingdom | Bacteria |
Replicon accession | NC_007963 |
Strand | - |
Start bp | 1738786 |
End bp | 1740645 |
Gene Length | 1860 bp |
Protein Length | 619 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637966716 |
Product | tetratricopeptide TPR_4 |
Protein accession | YP_573579 |
Protein GI | 92113651 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.80343 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCGCGAG CGGCCGGGTT CCCGGCCGAA GCGCTTTGCC CCGCCTGGCC GAGCCGGTAT GCTGATGCTT TCCGATGCAA ACTCGAGGAT GGCATGCGCA CACGCTTGAC TCTGGCCACC ACCCTGGCGT TGCTGCTGAC GGGCTGCCAG CACGCGGCGA CCACGCCCGG CGCCATCCCG CTCGAAGACC CCATGGAAAA CGCCCCACCG GTGGCTCGCG GCTTCGACGC CGAAGGGCTC TCGACACTGT TGCTGGCCGA AATAGCCGGC CAGCGTGGCG ACTACCGCCG AGCGGCTCGG GGTTACCTGG AGGTCAATGA ACGCTACGGC TCCGTCGCCC TGGCCGAACG CGCCACGCTC GCGGCGCGTT ATGCCAATGA CAGCGCCCTG CTGGAGCAGG CTGCGCTGCG CTGGCAACGG CTCGCCGACG ATGCCACCAC CCCGCACTAC GTGCTGGCCA GCCTGGCGGT GCAACGCGGC GACTGGGAAA CCGCTCTGGC GCAACGCCTG CGCATCGCCG CTCGGGAGCC CGACGCCGAA CTCGCCGCCC TGGCAGAACT GGCCATCGAG GACGATGCCG ATCTGGCACC GCTGATGGCA GACCTGCACG ACTTCATCGA CGAGCATCCC GACCATCTCG ACGCTCAACT GGCCACGGCA CAACTCGAGG CGGCGCAGGG CGACATCCAG ACAGCCGACG CCCGGCTCGC GCGCCTGTCG GACAACGCCG ACGCGCCTGC CGACGTATGG CTGACCCGCA GCACGATTGC CCTGCACCGT GGCGCATTGA GCGACGCACG TCGTTATGCC AGCCAGGGGC TGCGCCAGTT TCCCGACGAC AGCCGCTTGC GCCTGACGCT GGTTCAGGCA CGCCTGGCCG ACGGCGAGAT CGCCGCCGCC CACGACGATG TCGAACGCCT GCTGGCACGT CACGACGATA CCGCCTCCTT GCGCCTGGCT CTCGCCCAGA TGTATCTCGA CGCCGCGCAA CCCGATGGCG CACGGCGCCT TCTGCTGCCA CTGCTCGACC GCGACGACAC CCCGCCCGCC GCTTACCTGA TGCTGGGCAG CATCGCCGAG CGCGCCGGCG AGACCGACAA CGCGCTTCTT TATTACCGCC AGGTACCCAG CGGCGATGGC TTCATCGAAG CGCGCGCTCA GGCCGCCAGG ATGCTGGTCG CCGACGACCG CCCCCAGGAC GCCAGCGCCT TCCTGCGCAT CGAGCGCCTC CGTCACCCCG ATCAGCGCGT TCCCTTGTTG CGCCTCGAGC TGGACGTGCT CGATGCCCAG GGTCAGCAGG AACGCGCCGA CCGTCTTCTC GACGAGGCCA TCGCCAAGAC GCCCGACGCC ACCGAGCTTC GCTTTCAGCG CGCCATGCGC GCCTATCGCC AGGGCGACCT GCAGGCCATG GAGGCCGATC TGCGGGCAAT CATCGAGCGT GAACCGGACA ATGCCGAGGC ACTCAATGCG CTCGGCTATA CCCTGAGCAA CGACATGGGG CGCCACCGCG ACGCCCTGCC GCTCATCGAA AAGGCGCATC GCCTCGAGCC CGACAGCCCC GCGATCCTCG ACAGCCTCGG CTGGGTCTAC TTTCATCTCG GTCGTGCCGC CGACGCCCTG CCCTATCTGC GCGAGGCCTA TCGCGGCCAG CCCGATCAGG AAATCGCCGC GCATCTCGCC GAGGTGCTCG CTGCTACCGG GCAACGCGAT GACGCGCGCG ACCTGATAGA GGAAGCACAG CGCCGTTTTT CTCCGCATCC GCTCATCGAC GAGCTGTTGC GTCGCCAGCC TGAACTCGCG CCGGACGACA CGGCACCGGA TGCCGCCGAT TCGCCTCACG ACAAGGAAGC CGACTCATGA
|
Protein sequence | MSRAAGFPAE ALCPAWPSRY ADAFRCKLED GMRTRLTLAT TLALLLTGCQ HAATTPGAIP LEDPMENAPP VARGFDAEGL STLLLAEIAG QRGDYRRAAR GYLEVNERYG SVALAERATL AARYANDSAL LEQAALRWQR LADDATTPHY VLASLAVQRG DWETALAQRL RIAAREPDAE LAALAELAIE DDADLAPLMA DLHDFIDEHP DHLDAQLATA QLEAAQGDIQ TADARLARLS DNADAPADVW LTRSTIALHR GALSDARRYA SQGLRQFPDD SRLRLTLVQA RLADGEIAAA HDDVERLLAR HDDTASLRLA LAQMYLDAAQ PDGARRLLLP LLDRDDTPPA AYLMLGSIAE RAGETDNALL YYRQVPSGDG FIEARAQAAR MLVADDRPQD ASAFLRIERL RHPDQRVPLL RLELDVLDAQ GQQERADRLL DEAIAKTPDA TELRFQRAMR AYRQGDLQAM EADLRAIIER EPDNAEALNA LGYTLSNDMG RHRDALPLIE KAHRLEPDSP AILDSLGWVY FHLGRAADAL PYLREAYRGQ PDQEIAAHLA EVLAATGQRD DARDLIEEAQ RRFSPHPLID ELLRRQPELA PDDTAPDAAD SPHDKEADS
|
| |