Gene Clim_1842 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1842 
Symbol 
ID6355183 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2019415 
End bp2020554 
Gene Length1140 bp 
Protein Length379 aa 
Translation table11 
GC content53% 
IMG OID642669446 
Productpolysaccharide export protein 
Protein accessionYP_001943860 
Protein GI189347331 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1596] Periplasmic protein involved in polysaccharide export 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.185865 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTACT ACAGCAGCGC CACAGCCCGA AGAAAACACT CGACACGCAC TGCCCGTCCC 
CTGGTGATCA TCTGCGCCGC CATGCACCTG CTGCTCCTGG CCGCCTGCGG CAGCATATCT
CCGAAAGAAA CCACACAATA CGCTCCTCCG GAAAAAGAAT TCAAGGCAGA CATTCCGAAA
AAAACGCAGG AATTCGCCAA ACCCTCGACC ATAAGAGATC TCACCCCGAT CGAACAGTTC
AGTTACCGGC TCGGCCCTGG AGACATCCTC AGCGTGCAGG TATGGAGAAG ACCGGAGCTT
TCACAAGAAA ACATCATGGT CTCGCCCGAC GGCAACATCG CCATTCCGAG AATCGGCAAC
ATGAACGTGC TCAACCGAAC ACCGGCCGAA ATACAAAAAC TTATCACCGC CCGGCTCGAA
GTGCTCTACA TCAGGCCGGA AATAACCGTT CGAGTCCAGG AATTTCACAA CAATAAAGCT
TTCGTCCTGG GACGAGTCAC CAAACCCGGC GTCGTGAACT TCCCCGGCAG AGGCACCCTG
CTCGAAGCGC TCGCACTCGC CGGCGGACTA CCCTATCAGG GTAAAGAAAC CTTCCTCACC
AAATGCGCCA TCATCCGGGG CAACGATATC GTCATATGGA TCGACCTGCA GGATCTCCTC
AAAAACGGAA ATATGGCGCT CAACGCATCC ATCATGAACA ACGACGTCAT CTTCATTCCC
GAAGCTGAAG ATGAAATGAT CTACGTCATG GGAGAGGTCA TCACCCCCGG TGCCATACAG
CTGAAAAGCA GCATGAACGT ACTTAAAGCC ATCATGCTGG CCGGGGGCAT GAACAAGCAC
GCAAACCCCG AAAAAATCTT CATCATCCGC CAGCAGGACC TCAAAGGAAA CGTCATCAGG
GTAAACCTGA AAAATCTGCT CGAAAAGGGC GACTTCGCCA AAAACTATAC CCTTCTGCCT
GAAGACATCG TCTTCGTCAG CCCGAGCGGC ATGGCAAAAT TCAACTACAC CCTCGAAAAA
CTCATCCCGG CGCTGCAGGT GCTCAACCTC GGTATCGACA ACTTCGAATC ATTCGGCCTC
ATGCAGGAAT TGCGCAGAAA GCTCTGGGGA CAGGAAGGTT TCGTCAATTC CAGCGAATGA
 
Protein sequence
MQYYSSATAR RKHSTRTARP LVIICAAMHL LLLAACGSIS PKETTQYAPP EKEFKADIPK 
KTQEFAKPST IRDLTPIEQF SYRLGPGDIL SVQVWRRPEL SQENIMVSPD GNIAIPRIGN
MNVLNRTPAE IQKLITARLE VLYIRPEITV RVQEFHNNKA FVLGRVTKPG VVNFPGRGTL
LEALALAGGL PYQGKETFLT KCAIIRGNDI VIWIDLQDLL KNGNMALNAS IMNNDVIFIP
EAEDEMIYVM GEVITPGAIQ LKSSMNVLKA IMLAGGMNKH ANPEKIFIIR QQDLKGNVIR
VNLKNLLEKG DFAKNYTLLP EDIVFVSPSG MAKFNYTLEK LIPALQVLNL GIDNFESFGL
MQELRRKLWG QEGFVNSSE