Gene Clim_0478 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0478 
Symbol 
ID6354473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp541170 
End bp542201 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content51% 
IMG OID642668109 
ProductTetratricopeptide TPR_2 repeat protein 
Protein accessionYP_001942550 
Protein GI189346021 
COG category[R] General function prediction only 
COG ID[COG4785] Lipoprotein NlpI, contains TPR repeats 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones41 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAACTC GCATTCTCCC TTTTTTTCTC TTTTTTTTTG TATTTGTTCT CCCCGTTGTT 
TCTGCTGCAG ACGACTTCGA TGCTCTGTTT GTGCAAGCCG GGATGAAATA CCGGAAAGGC
GATTTTCAGG GCGCATCGGC TCTGTATACT GCCGTGCTGA GCGGCAATCC GCGTTCTGCG
GAAGCGTATA ACAACCGCGG GCTCTGTAAA GCAGCTTCAG GAGATGTCAC CGGCGCTGTT
GCTGACTATT CCGAAGCCCT TAAACTCGAT CCGTCCCTGG CGGCGGCCTC CAATAACAGG
GGCCTCGCAA TGGCAAAGAT CGGGAAGTAT CATGAAGCTG TTCTCGATTA TAATCAGGCC
CTCCGTATCA ATGCCGTTCT GCCTGAAGTG TACAACAATC TCGGATTGGC CAGAATCGCA
TTGGGAGATC AATCAGGAGC ACTCGACGAT TTCAATACGG CCCTTGCGCT TAAACCTTTT
TATCCCGAAG CGCTTTTTAA CAGGGGGTGT GCCCGGCAGA AGCTGTCAGA ACACCGGGAA
GCTCTTCGGG ACTTTCAACA GGTCATATCC TTCAGATCGG GATATGCCGA GCCTTATTTT
TATGCTGCGC TTTCACGTTC TGCTATGGGG GATCACAAGG GCGCTCTCGT AGATTATACA
AAAGCGATTG CCATTTCTCC CTCATACGCG GAAGCTTTTG CAGGCAGAGC GCTTGCGAAG
ATCAGAAGCG GTGATTATCG CGGGGCTCTC GACGATTACG ATACGGTGAT AGGGCTGCAG
TCTGATAATC CGGAACTTTA CTATAATCGG GCGCTGGTCA AGGTCAAGCT GTCTGACTAT
CCGGGAGCTG AAATTGACTG TTCACTCGCT CTCGAACGGA ACAAGGTATA TGCCGAAGCT
TTTTTTCTCA GGGGTATCGT TCGGAGTGAA CTTGGAAACC GCGAGGGTAT GCTTGCCGAT
TTGCGTTTTT CTGCAGATGC AGGTTATGAG CCGGCAAAGA AGCTGCTGAA AAAAGAACGG
GACAGGAGAT AG
 
Protein sequence
MKTRILPFFL FFFVFVLPVV SAADDFDALF VQAGMKYRKG DFQGASALYT AVLSGNPRSA 
EAYNNRGLCK AASGDVTGAV ADYSEALKLD PSLAAASNNR GLAMAKIGKY HEAVLDYNQA
LRINAVLPEV YNNLGLARIA LGDQSGALDD FNTALALKPF YPEALFNRGC ARQKLSEHRE
ALRDFQQVIS FRSGYAEPYF YAALSRSAMG DHKGALVDYT KAIAISPSYA EAFAGRALAK
IRSGDYRGAL DDYDTVIGLQ SDNPELYYNR ALVKVKLSDY PGAEIDCSLA LERNKVYAEA
FFLRGIVRSE LGNREGMLAD LRFSADAGYE PAKKLLKKER DRR