Gene Clim_0142 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0142 
Symbol 
ID6356107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp156701 
End bp157966 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content49% 
IMG OID642667764 
Producthypothetical protein 
Protein accessionYP_001942220 
Protein GI189345691 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.0129237 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAAAAC AGCAGCGACA CGCGGTACAG CAATGCCCGG TAAGCCACCT TCCCGTCAGA 
ACGCAACCGC ACTGGACAGT CTTTCATGAA CATGCTGATT ATACCGTAAC ATTCGAGGTT
ATCGGAACGG ATATCCTGCA TGGCCAGGCC ATTGCAGATC ATGAGACTGT TCTGGACTAC
ATGGATAACG ATCTTTTTCA GGCCGTATGC GACGATCTCG ATTTAACGGG TCGGGATGTA
TTTGTCATGA TAAATCTCAA TCCCATCCGG AAAATCGCGT TATCGTACAA AAGAGATTTT
GCGAACCTTG TCTATAACTG GGGGCCGCTC TTCTCCGTGC TGATCGTCTA CAATGTTCAT
CCGGAAATTC TGCCGATCAT CGAAGGCTTT GCCTCCATAT GTCCGTGCAA CACCCGAATG
GAAATTGCAG ACTCCTATCG GAATGCCCTC CTCCGAATCT CATCCCACAA AAAACACTCC
GAACTGAGCG CCTTACCGGA AAATACAGCC GGAGACAGAG ATGATTATCA GAAAAAAGCA
TTTCTGTGCA CACTTGCAAG AATGTTATGG ATCGATATGC TCGATTATCC TGTCCCCGTT
CTGAATCGGG ATGACAGTCG CTACGCCTAT TTCCTGGCTC TTGAAGCGTT CAGAAAAGAC
CTGCTGGCAA AAGAACAGGA GCATCAGGAG AAAATCGGCA CGCTGAAGCG CGGTGCAGAC
CATGAGTACG AACAGCATCA AATCCACGTG AACGCCGAAA TCGATCTTCG AAAAAAAAAT
GCGGACGGTT TTTCCAGAAT AATTGAAGAG CTGAAGCAGA TAATTGCAGC TAAAGATGCG
GAACTTGCCG GCATCAGCTC GATAATTGAA GAAAAAACCG GAAAACTAAG CAGAATATGT
GAACAGATAT CCCAGGCGAC CATCAGCCCG GAACAGAAAA ACGCCATGAT CCGAGATTGC
AGAAACATGA TAGCCGCCGA CGCCGCCGAA CAGCAGATTC GGATGGAACT GACCGATACA
GACACCGCAT TCCTTGCCCT GCTCCAGGAA AAACATCCCG GCCTCTCAGA AAAAGAGCTG
CGCATCAGCC TGCTCATCAA ACTGAACTAC GACACCAGAG AAATTGCCCG GATGAGCGGA
CTGACGAAAA GAGGGATGGA AACGACCCGC TACCGCATGC ATAAAAAGCT CGGACTGGAA
AAGCACCACT CCATGAAATA TTATTTCTCA TCTCTGGCTG AAAATCCCCC CGGCAATCGC
CCGTAA
 
Protein sequence
MKKQQRHAVQ QCPVSHLPVR TQPHWTVFHE HADYTVTFEV IGTDILHGQA IADHETVLDY 
MDNDLFQAVC DDLDLTGRDV FVMINLNPIR KIALSYKRDF ANLVYNWGPL FSVLIVYNVH
PEILPIIEGF ASICPCNTRM EIADSYRNAL LRISSHKKHS ELSALPENTA GDRDDYQKKA
FLCTLARMLW IDMLDYPVPV LNRDDSRYAY FLALEAFRKD LLAKEQEHQE KIGTLKRGAD
HEYEQHQIHV NAEIDLRKKN ADGFSRIIEE LKQIIAAKDA ELAGISSIIE EKTGKLSRIC
EQISQATISP EQKNAMIRDC RNMIAADAAE QQIRMELTDT DTAFLALLQE KHPGLSEKEL
RISLLIKLNY DTREIARMSG LTKRGMETTR YRMHKKLGLE KHHSMKYYFS SLAENPPGNR
P