Gene Csal_1527 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCsal_1527 
Symbol 
ID4029229 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChromohalobacter salexigens DSM 3043 
KingdomBacteria 
Replicon accessionNC_007963 
Strand
Start bp1738786 
End bp1740645 
Gene Length1860 bp 
Protein Length619 aa 
Translation table11 
GC content68% 
IMG OID637966716 
Producttetratricopeptide TPR_4 
Protein accessionYP_573579 
Protein GI92113651 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG5010] Flp pilus assembly protein TadD, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.80343 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCGCGAG CGGCCGGGTT CCCGGCCGAA GCGCTTTGCC CCGCCTGGCC GAGCCGGTAT 
GCTGATGCTT TCCGATGCAA ACTCGAGGAT GGCATGCGCA CACGCTTGAC TCTGGCCACC
ACCCTGGCGT TGCTGCTGAC GGGCTGCCAG CACGCGGCGA CCACGCCCGG CGCCATCCCG
CTCGAAGACC CCATGGAAAA CGCCCCACCG GTGGCTCGCG GCTTCGACGC CGAAGGGCTC
TCGACACTGT TGCTGGCCGA AATAGCCGGC CAGCGTGGCG ACTACCGCCG AGCGGCTCGG
GGTTACCTGG AGGTCAATGA ACGCTACGGC TCCGTCGCCC TGGCCGAACG CGCCACGCTC
GCGGCGCGTT ATGCCAATGA CAGCGCCCTG CTGGAGCAGG CTGCGCTGCG CTGGCAACGG
CTCGCCGACG ATGCCACCAC CCCGCACTAC GTGCTGGCCA GCCTGGCGGT GCAACGCGGC
GACTGGGAAA CCGCTCTGGC GCAACGCCTG CGCATCGCCG CTCGGGAGCC CGACGCCGAA
CTCGCCGCCC TGGCAGAACT GGCCATCGAG GACGATGCCG ATCTGGCACC GCTGATGGCA
GACCTGCACG ACTTCATCGA CGAGCATCCC GACCATCTCG ACGCTCAACT GGCCACGGCA
CAACTCGAGG CGGCGCAGGG CGACATCCAG ACAGCCGACG CCCGGCTCGC GCGCCTGTCG
GACAACGCCG ACGCGCCTGC CGACGTATGG CTGACCCGCA GCACGATTGC CCTGCACCGT
GGCGCATTGA GCGACGCACG TCGTTATGCC AGCCAGGGGC TGCGCCAGTT TCCCGACGAC
AGCCGCTTGC GCCTGACGCT GGTTCAGGCA CGCCTGGCCG ACGGCGAGAT CGCCGCCGCC
CACGACGATG TCGAACGCCT GCTGGCACGT CACGACGATA CCGCCTCCTT GCGCCTGGCT
CTCGCCCAGA TGTATCTCGA CGCCGCGCAA CCCGATGGCG CACGGCGCCT TCTGCTGCCA
CTGCTCGACC GCGACGACAC CCCGCCCGCC GCTTACCTGA TGCTGGGCAG CATCGCCGAG
CGCGCCGGCG AGACCGACAA CGCGCTTCTT TATTACCGCC AGGTACCCAG CGGCGATGGC
TTCATCGAAG CGCGCGCTCA GGCCGCCAGG ATGCTGGTCG CCGACGACCG CCCCCAGGAC
GCCAGCGCCT TCCTGCGCAT CGAGCGCCTC CGTCACCCCG ATCAGCGCGT TCCCTTGTTG
CGCCTCGAGC TGGACGTGCT CGATGCCCAG GGTCAGCAGG AACGCGCCGA CCGTCTTCTC
GACGAGGCCA TCGCCAAGAC GCCCGACGCC ACCGAGCTTC GCTTTCAGCG CGCCATGCGC
GCCTATCGCC AGGGCGACCT GCAGGCCATG GAGGCCGATC TGCGGGCAAT CATCGAGCGT
GAACCGGACA ATGCCGAGGC ACTCAATGCG CTCGGCTATA CCCTGAGCAA CGACATGGGG
CGCCACCGCG ACGCCCTGCC GCTCATCGAA AAGGCGCATC GCCTCGAGCC CGACAGCCCC
GCGATCCTCG ACAGCCTCGG CTGGGTCTAC TTTCATCTCG GTCGTGCCGC CGACGCCCTG
CCCTATCTGC GCGAGGCCTA TCGCGGCCAG CCCGATCAGG AAATCGCCGC GCATCTCGCC
GAGGTGCTCG CTGCTACCGG GCAACGCGAT GACGCGCGCG ACCTGATAGA GGAAGCACAG
CGCCGTTTTT CTCCGCATCC GCTCATCGAC GAGCTGTTGC GTCGCCAGCC TGAACTCGCG
CCGGACGACA CGGCACCGGA TGCCGCCGAT TCGCCTCACG ACAAGGAAGC CGACTCATGA
 
Protein sequence
MSRAAGFPAE ALCPAWPSRY ADAFRCKLED GMRTRLTLAT TLALLLTGCQ HAATTPGAIP 
LEDPMENAPP VARGFDAEGL STLLLAEIAG QRGDYRRAAR GYLEVNERYG SVALAERATL
AARYANDSAL LEQAALRWQR LADDATTPHY VLASLAVQRG DWETALAQRL RIAAREPDAE
LAALAELAIE DDADLAPLMA DLHDFIDEHP DHLDAQLATA QLEAAQGDIQ TADARLARLS
DNADAPADVW LTRSTIALHR GALSDARRYA SQGLRQFPDD SRLRLTLVQA RLADGEIAAA
HDDVERLLAR HDDTASLRLA LAQMYLDAAQ PDGARRLLLP LLDRDDTPPA AYLMLGSIAE
RAGETDNALL YYRQVPSGDG FIEARAQAAR MLVADDRPQD ASAFLRIERL RHPDQRVPLL
RLELDVLDAQ GQQERADRLL DEAIAKTPDA TELRFQRAMR AYRQGDLQAM EADLRAIIER
EPDNAEALNA LGYTLSNDMG RHRDALPLIE KAHRLEPDSP AILDSLGWVY FHLGRAADAL
PYLREAYRGQ PDQEIAAHLA EVLAATGQRD DARDLIEEAQ RRFSPHPLID ELLRRQPELA
PDDTAPDAAD SPHDKEADS