Gene Cag_1169 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCag_1169 
Symbol 
ID3747925 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium chlorochromatii CaD3 
KingdomBacteria 
Replicon accessionNC_007514 
Strand
Start bp1563540 
End bp1564874 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content47% 
IMG OID637773703 
ProductTPR repeat-containing protein 
Protein accessionYP_379474 
Protein GI78189136 
COG category[N] Cell motility
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG3063] Tfp pilus assembly protein PilF 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACCAT TTTTCACTTT TTTACGCTAT GCCTTCATTG GTATTTTAAC GCTTTCCCTT 
GTAACCCCTG AGGTTGTAGA TGCTGCTAAA AAATCCAAAA AGAAAAGCAG TAGCCGTAAA
AAATCATCCA AACGTAACGC ACGCGCTAAA AAAGGCTCGA ATAAAAAAAC ATCAGCGCGG
CAAGCTCGTT TACGCGTGGT TGATGGGGTA GAAACGGAAC GAAATTCCAT AAATCTTACT
GCCTCGCCAT CAAGTGCTTC GCGTCAGCTC AATAAGCGAG CAATGGGATT TTATGAGCAA
GGGCGTTATG CTGAAGCTGA GCCACTCTAT CGAGAATTAC TTACTCTTGA TGAAAAACAG
TTAGGCAGTC GCCATCCCGA AGTTGCAGTT ACCTTAAACA ACCTTGCTTC ACTCTTGCAG
CAACAAGGGC GATATAACGA AGCCGAGCCA CTCTATCGCC GTGCGCTCTC TATTCGCGAA
GAAAATTTTG GGGCTGACGA TGCAAGTGTA GCGCAAAGCT TAAACAATCT TGGCTCGCTC
TTGCAAGATC AAGGACGTTA CTATGAAGCA CGTCAGCTTT ATAGCCGCTC ACTTGCAATT
GATGAAAAAG TGTTGGGAAC CGACCATCCC GATGTTGCCG CCGACCTCAA CAACCTTGCC
TCACTACTAC AAGCACAAGG GCGTTATGCC GAAGCTGAGC CGCTCTATCG CCGTTCCTTA
GCCATTCGTG AGCAACGATT TGGTGCAGAG CATACGCTGG TTGCTATGAG CCTCAATAAT
CTTGGCGTGC TCTTGCAAGC ACAAGGGCGT TATAGTGAAG CCGAGCCACT CTATCGCCGC
TCGCTTGCCA TTCGTGAGGC TCAATACCCC GCCAACAACC ACTCAATTGT TGCAACAAGT
CTCAATAATC TTGCCTCCCT TTTGCAGGCA CGAGGAAAAC TTACTGAAGC TGAACCCATT
TATCAGCGCG CATTGTCCAT CAACGAACAA ACCTTAGGTG AAAACCACCC ATCAGTTGCA
ACAAGCCTCA ATAATCTTGC TGGGTTGCTT AGGGCACAAG GGCGATATGC CGATGCTGAA
CCTCTTTACC GCCGCTCGTT AACAATACGT GAAGAACAGC TTGGCGAAAA CCACCCCGAT
GTTGCTATGA GCCTCAATAA TCTTGGAGTG CTCTTGCAAG CACAAGGGCG TGCCAGCGAA
GCCGAACCAC TCTATCGCCG AGCATTACTG ATTGATGAAA AAGTATTAGG AGCTACGCAC
CCACAAACAA TCCGTTTACG CAATAATCTG AATGCTTTAC TGAATCCATC AGCAATACCA
CTAACCACCC AATAA
 
Protein sequence
MKPFFTFLRY AFIGILTLSL VTPEVVDAAK KSKKKSSSRK KSSKRNARAK KGSNKKTSAR 
QARLRVVDGV ETERNSINLT ASPSSASRQL NKRAMGFYEQ GRYAEAEPLY RELLTLDEKQ
LGSRHPEVAV TLNNLASLLQ QQGRYNEAEP LYRRALSIRE ENFGADDASV AQSLNNLGSL
LQDQGRYYEA RQLYSRSLAI DEKVLGTDHP DVAADLNNLA SLLQAQGRYA EAEPLYRRSL
AIREQRFGAE HTLVAMSLNN LGVLLQAQGR YSEAEPLYRR SLAIREAQYP ANNHSIVATS
LNNLASLLQA RGKLTEAEPI YQRALSINEQ TLGENHPSVA TSLNNLAGLL RAQGRYADAE
PLYRRSLTIR EEQLGENHPD VAMSLNNLGV LLQAQGRASE AEPLYRRALL IDEKVLGATH
PQTIRLRNNL NALLNPSAIP LTTQ