Gene Clim_1310 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1310 
Symbol 
ID6354503 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1410414 
End bp1411439 
Gene Length1026 bp 
Protein Length341 aa 
Translation table11 
GC content50% 
IMG OID642668925 
ProductSel1 domain protein repeat-containing protein 
Protein accessionYP_001943355 
Protein GI189346826 
COG category[R] General function prediction only 
COG ID[COG0790] FOG: TPR repeat, SEL1 subfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.218187 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAACGTT ATATCATCGG ATTACTTGCA GCTTGTATGC TGATGCAGCC AGCCGGAGCT 
CGTTGTGAAA TTCCCTTGCT GGACAATATT TCTCAGTTGC AGAAAGAGGC TCAACAGGGA
AATGCAGTAG CACAGAACAA GCTTGGACTG CTGTACTACA CTGGTCAGGG GGTCAAGCGG
GACTATGTCG AAGCGCTGCG ATGGTATCGT ATGGCTGCCG AGCAACAACG CGCATGGGCG
CAAGTCAGTC TTGGCGTAAT GTACTACACT GGTCAGGGAG TTAAGCAGGA CCATGCGGAA
GCGGCAACCT GGTTTCGCAA GGCTGCCGAG CAAGGGCTTC CAAAGGGGGA ATACTATCTG
GGAGTAGTAT ATGAAAAAGG TCAGGGAGTA AAGCAGGACC ATGCAGAAGC GGCAACCTGG
TTTCGAAGGG CTGCCGGGCA GGGTCTTGCA GAGGCTCAGA ACAAGCTGGG CCTTATGTAC
TACTCAGGTC AAGGCGTTAA ACAGGACTAT GTGGAAGCGG CAACCTGGTT TCGAAAGGCT
GCAGTTCAGG AATTCGCACT GGCACAAAAC AGCCTTGGCG TCATGTACTA CACTGGTCAG
GGAGTAAAGC AGGACCATGC GGAAGCGGCA ACCTGGTTTC GAAAGGCTGC CGGGCACGGC
CTTTCTGTAG CAGAAAACAA GCTGGGCCTT ATGTACTATA CCGGCCAAAG TGTTAAACAG
GACTATACAG AAGCCGCAGG ATGGTTTCGT AAAGCTGCAG TTAAAGGACT CGCAGAAGCG
CAATTAAATA TCGGGATGCA GTACTACGCA GGACAAGGAG TAAATCAGGA CTATACAGAA
GCTGCAGGTT GGTATCGTAA GGCTGCCGAG CAGGGTCTTG CAGAGGCACA GTATAATTTA
GGAGCAGTTT ACCTGAATGG GAGTGGCATA ACGAAGGACG AACAAAAAGC AAGAGAATGG
TATAAAAAAG CCTGTAACAA TGGATATCGA CCAGCTTGCG ATGACTACCT GAAGATTAAC
GAGTGA
 
Protein sequence
MKRYIIGLLA ACMLMQPAGA RCEIPLLDNI SQLQKEAQQG NAVAQNKLGL LYYTGQGVKR 
DYVEALRWYR MAAEQQRAWA QVSLGVMYYT GQGVKQDHAE AATWFRKAAE QGLPKGEYYL
GVVYEKGQGV KQDHAEAATW FRRAAGQGLA EAQNKLGLMY YSGQGVKQDY VEAATWFRKA
AVQEFALAQN SLGVMYYTGQ GVKQDHAEAA TWFRKAAGHG LSVAENKLGL MYYTGQSVKQ
DYTEAAGWFR KAAVKGLAEA QLNIGMQYYA GQGVNQDYTE AAGWYRKAAE QGLAEAQYNL
GAVYLNGSGI TKDEQKAREW YKKACNNGYR PACDDYLKIN E