Gene Clim_2043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2043 
Symbol 
ID6355548 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2253203 
End bp2254252 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content50% 
IMG OID642669639 
ProductOmpA/MotB domain protein 
Protein accessionYP_001944051 
Protein GI189347522 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG2885] Outer membrane protein and related peptidoglycan-associated (lipo)proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.0433092 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACATC AGGATTCACG TTCGCGTTCC ATATTCTGCC GGATGGGCAG GATTCTTATG 
TTTTTGCCGC TTTTTGTGCT TTTTTTGGCA GGAACGGATC TTTCAGCAGC CGATCTTCCG
GGTTCGAAAG ACAATCCGTT GCTGAAGCGG TTTGCCGGTT CGGAAATTGT CGGTTATCAC
GCAAAGAGCT TCGATGAATA CGAGCTTCAG ACCTCTACGT TCATTCGTTA CAATTTCGAA
ACCAGAAAAC GGGATTATGC AAAACCGCCG CTTAAACCGG AAGGCCGGCT GACGAGAATC
TGGTACGAGG CGGCCGGAGA TACCGGTTCG CTGGAAGTTT ACCGGAATTA TCTCAATGAA
CTGCGATCGA ATGGCTTCGT CATTCTCTAT GATTCCAAAA AAGATCCCGC GGCGACAAAA
TGGACGAACT ACCTTGCTCC TTTCGGATCT GTCGATCTTA CCACCAACAG AAGCAAGTAT
GTTTTTTTCG CTGCCGAGAA AAACGCTATC TGTGTTGCAA GCGCCAAAAA GAAGCGGCCT
GAAGGGGATG TTTATGTTTA TCTGACCGTT ATCGAATGGG GAAAGGATGA TTCGGTCTAT
AAGGCCAGAC GCGGAGCCTA TGCGGCGGTC GATATCATCG AAACCAGGCC AATGCAGCAG
AAAATGGTTA CGGTTTCTGC AGATGAAATG TCGCGCTCCA TCACTTCGAC CGGCAAGGTC
TCTCTTTACG GCATTTATTT CGATACCAAC AAGGCGGATA TAAAACCGGC CTCGAAACCA
GCTCTCGGGG AGATCGCAAA ACTTTTGAAG AAACAGCCGG CAATGAAGCT TCATGTTGTA
GGCCATACCG ACAATGCCGG TGGCTACGAA TTCAATGTAT CGCTTTCGAA ACGCAGGGCC
GATGCAGTGG TCGGTGTGCT GCAGAAAGAG TATGGTATCG CTCCCGGTCG CCTGACCGCC
AATGGTGTGG CCTATCTCGC TCCCGTTGCT TCCAATGCGG CTGAAGCCGG AAGGGCGAAA
AACCGTCGCG TCGAACTGGT GCCGAGATAA
 
Protein sequence
MKHQDSRSRS IFCRMGRILM FLPLFVLFLA GTDLSAADLP GSKDNPLLKR FAGSEIVGYH 
AKSFDEYELQ TSTFIRYNFE TRKRDYAKPP LKPEGRLTRI WYEAAGDTGS LEVYRNYLNE
LRSNGFVILY DSKKDPAATK WTNYLAPFGS VDLTTNRSKY VFFAAEKNAI CVASAKKKRP
EGDVYVYLTV IEWGKDDSVY KARRGAYAAV DIIETRPMQQ KMVTVSADEM SRSITSTGKV
SLYGIYFDTN KADIKPASKP ALGEIAKLLK KQPAMKLHVV GHTDNAGGYE FNVSLSKRRA
DAVVGVLQKE YGIAPGRLTA NGVAYLAPVA SNAAEAGRAK NRRVELVPR