Gene Clim_1946 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1946 
Symbol 
ID6355001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2158567 
End bp2159862 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content55% 
IMG OID642669544 
Productouter membrane efflux protein 
Protein accessionYP_001943957 
Protein GI189347428 
COG category[M] Cell wall/membrane/envelope biogenesis
[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG1538] Outer membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones38 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACAGAA TGCTTCTTGC TTGCAGGGTC GGTATCTGTT CGTTGTTGAC GCTGCTTTTT 
TTCTCTGTTC CGAGGCCTCT CCAGGCGGTC GAGCTGCTGA CCTGGCAGCA GTGCGTCAGC
GAAGCCGGAA AAGCCCATCC CGATCTCTAT TCGGCTCTGG CCGCAATGCA GCAGGCCGAA
GCCGACAGCC GCATCACCGG TGGACAGCTT CTTCCACAGT TGAGTGCTGG TTTGACGGCC
GTGCAGAACG GCACTACCGA ACAAAACATC GGTTCCTCAT CGGCATTTTC CTGGTCGCTG
AGTGCAAGGC AGCTGCTCTA TGACGGTCGG AAAACCTCCC GACAGGTTGC TGCGTACAAA
GAGGCTGAGA AAGCGGCGGG CCACAATTAC AGTGCGGTTT CTGCCGATGT GCGATTCGCC
CTCCGTTCGG CATTTACCGA TCTTCTCAAA GCCGAGGAAC TGGTTGCTCT TTCAAAGGAG
ATTGCCGAAC GCAGAAGGAA GAATTTCCGG CTCATCGGTC TTCGCTATCA GGCCGGAAGA
GAACATATCG GTTCGCTCCG CAAGGCCGAG GCCGATCTCG GCGAGTCGGA GTTTGAAGTC
TCACGGGCCG AAAGGGGGCT TGTGCTTGCA CAGTCCATAC TCGCTTCGGC TCTCGGGCGC
GATCTGCGCA GTCCGATCAG GGTAAAGGGC TCGTTCACCG TCGAAGATGT TCTCACGGTA
AAGCCGAATC TTGCCCTGCT TGCCAGAGAA AGTCCGCTGG TACAGCAGCT CGAAGCCAGG
CGAAAAGCCG CCCGTTACGA TCTCGAAGCC GCAAAAGGGG CATTCTCTCC CGAACTATCC
CTGACCTCTT CTATCGGCAG AAATTCCTTT GACAGCCTGC CTCCCGACGA GGTCGACTGG
AATGCCGGAA TCGAGCTGTC TTTGCCGATA TACGGCGGCG GCACGGGCCG TGCCAAAGTC
GCGCGGGCCA TGGCGGTTGT CAGTCAGCAG AATGCCGAAG AGAAAAGCGG CTACCTTCAG
GTGTTCGATA TGCTGGAAGA GAGCTGGAAG AATTTTCTGG ATGCCCGTCA GTACGTTACC
GTGCGGAGAA AATTTCTCGA TGCGGCCGTT GAACGCGCAG CTATTGCCAA TGTCCAGTAT
TCGAACGGTC TTATCTCTTT CGACGACTGG GTGATAATAG AAGACAATCT TGTGAGCGCA
AAAAAGGAGT TTCTCAATGC CGGAGCAGAT CTCTTGATTG CCGAGGCTCA ATGGATCAAG
GCAAAAGGGG GAGGACTTGA TGATCAGCAG AAGTAA
 
Protein sequence
MYRMLLACRV GICSLLTLLF FSVPRPLQAV ELLTWQQCVS EAGKAHPDLY SALAAMQQAE 
ADSRITGGQL LPQLSAGLTA VQNGTTEQNI GSSSAFSWSL SARQLLYDGR KTSRQVAAYK
EAEKAAGHNY SAVSADVRFA LRSAFTDLLK AEELVALSKE IAERRRKNFR LIGLRYQAGR
EHIGSLRKAE ADLGESEFEV SRAERGLVLA QSILASALGR DLRSPIRVKG SFTVEDVLTV
KPNLALLARE SPLVQQLEAR RKAARYDLEA AKGAFSPELS LTSSIGRNSF DSLPPDEVDW
NAGIELSLPI YGGGTGRAKV ARAMAVVSQQ NAEEKSGYLQ VFDMLEESWK NFLDARQYVT
VRRKFLDAAV ERAAIANVQY SNGLISFDDW VIIEDNLVSA KKEFLNAGAD LLIAEAQWIK
AKGGGLDDQQ K