Gene Cpha266_1746 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1746 
Symbol 
ID4571108 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1972472 
End bp1973812 
Gene Length1341 bp 
Protein Length446 aa 
Translation table11 
GC content50% 
IMG OID639766329 
ProductTolC family type I secretion outer membrane protein 
Protein accessionYP_912187 
Protein GI119357543 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID[TIGR01844] type I secretion outer membrane protein, TolC family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAGAGTT TTTGGTATGT AATTCTTATA GCAGGCAGTT TTGTTTCGCC GCCTCTTTAT 
GCCGCGCCAG TCACTATATC GGAAGCTTAT CTGAAAGCCC GAGATCATGA TGCCCTGCTG
GGTGCGGCAA AGGCTGATAA TCTGGTGTAT AAAGAGGAGG TCGGCAAGGC GAGGGCCGGT
TTACGCCCCA GTGTACGTCT GAACGCCTCA CGCGGCCGCA ACGGCACACA GCATGGTTAC
ATGGGACTAT ATGAGGCGCC TGATTTTTAT AATACGGTCG TCTATGGTGT GACGCTTCGT
CAACCATTGA TTAATCTGTC GAATATAGCT GAATACCAGC AGGCGAAAGC TGTGGCGGCA
AAAAGTGATG CTGAATTGCG GAAAGAAGAG GCTGATCTGA TTGTCAGACT TGCCGAGACC
TATTGTAACG CATTGTATGC CGAGGATAAC CTTACCTTCA GTCAGGCTCA TATCAAGGCT
TCGGAAGAAC AGCTTCAGCA GGCGAAACGA CGGTTTGAAA AAGGGTTCGG AACCATTACC
GAAATAAATG AAGCGCAGGC TGACCTCGAT ATGGCGCTTG CTGAGGGACT TGAAATCGTC
AACAGCGTCG AGTTCAGCCG CCGTGAGCTG GAGCATCTTA TCGGAACGTA TCCTGATGAA
CTCTGCAAGT TGGCGCCTGA AAAACTTGTG TTGGCCCGAC CGACGCCAGG TCGTGTGGAA
TCTTGGATTG ATCGAGCTCA TGGAGAGAGT CCTCGAAGCT CGGCAGCGCA CCTTGAGATG
CAGATCGCAA AAAAAGAGAT CGAAAAACAG AAGGCAAGCC GTTATCCGAC GATTGATCTT
GTTGCCGGGA GAAGCTATTC CGAGAGTGAA AATAATTATT CAATCGGCTC AACCTATGAA
ACCTATTCGA TCAGTCTGCA AATGAGCATG CCGATTTATA CCGGTGGTTA TGCCAGTGCA
TCGATCCGGC AGGCAAAAGC AAAGTGGCTT AAAGCGGGTG AGCAGTTTTT CTGGCAGGAA
CGGAGCATCG AATCTGAAGT ACGCAAGTAC TATAATACGG TTATCAGTAC GATCGCACAG
ATCCAGGCCT ATGAACAGGC GGTCAAATCC CGGGAAATTG CTCTCGACGG CACAAAAAAA
GGGTTTGGTG CTGGTCTGCG CAGTAATGTG GATGTTCTTG ACGCTCTTCA GAATCTTCTT
GCCGCCAGAC GTAATCTTGC GAAGTCGCGA TACCAGTATA TTCTTGCCCG TCTTTCGCTC
AAGCAGACCG CAGGAGCATT GTCGCCCGCT GATATTGAGG AGATCAATGG CTGGTTTGCA
ACGGCAAAAA CAGCGAAATA G
 
Protein sequence
MKSFWYVILI AGSFVSPPLY AAPVTISEAY LKARDHDALL GAAKADNLVY KEEVGKARAG 
LRPSVRLNAS RGRNGTQHGY MGLYEAPDFY NTVVYGVTLR QPLINLSNIA EYQQAKAVAA
KSDAELRKEE ADLIVRLAET YCNALYAEDN LTFSQAHIKA SEEQLQQAKR RFEKGFGTIT
EINEAQADLD MALAEGLEIV NSVEFSRREL EHLIGTYPDE LCKLAPEKLV LARPTPGRVE
SWIDRAHGES PRSSAAHLEM QIAKKEIEKQ KASRYPTIDL VAGRSYSESE NNYSIGSTYE
TYSISLQMSM PIYTGGYASA SIRQAKAKWL KAGEQFFWQE RSIESEVRKY YNTVISTIAQ
IQAYEQAVKS REIALDGTKK GFGAGLRSNV DVLDALQNLL AARRNLAKSR YQYILARLSL
KQTAGALSPA DIEEINGWFA TAKTAK