Gene Clim_0827 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_0827 
Symbol 
ID6353897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp908806 
End bp910446 
Gene Length1641 bp 
Protein Length546 aa 
Translation table11 
GC content53% 
IMG OID642668451 
Productprotein of unknown function DUF814 
Protein accessionYP_001942886 
Protein GI189346357 
COG category[K] Transcription 
COG ID[COG1293] Predicted RNA-binding protein homologous to eukaryotic snRNP 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACGCA ATTACTTCAC GCTTTACCAT ACCGCCATGG AGCTGCATGA ACGGCTTGCC 
GGCGGGTTCG TATTCGAAAT CCATTCGCAG CAGAAAAACG AGCTGACACT GAGCTTCGTC
ACGGCAGCCG GAGATCACCT GCAGGTGGTA ATGGTATCGC GCAAGCCTGA ACTCTGCCTT
TTCACAAGAG AAGGCATGAA CCGACGGAAA CGGCAGAGCG CAAACCTGCT GCATACCATC
TGCGAGCGGG AGGTCACCGG CGTTGCAATG TCACCGTACG ACAGGGAAAT TCTTGTCGGG
CTCTCGGGAG GCGACACGCT TGTATTGCGC CTCTTCAGCG CCTCAGCCAA TGCATTCCTC
GTCTCGGAAG GGATCATTAC CGATGCCTGT ACCTCCAGGA ACGACCTGAT CGGCAAGCCA
TACCGTGAAA AGGACATCCA TTTGGCGCAG GACATCATCC GCGAACTGGA ACTCCTTGCG
CAGGACAAAC AACTGTTTTC ATTGAGAATC GGGCACGCTG CGGGCCCGGA TAATCAGGAC
TCGCTCAACT TCCTCCCGGG CTTCGACCGG ACGATGCTGA GGGCTCTCGT TGAACGTGCA
GGCGGAAATG ACGATCCTGA CAATCTTTTC AATGCTTTCA GGGAAATATT TTATGAACTG
CTCGATCCGC TGCCCCAGAC AGGAAAAACA CCGGAAGGCA AACCACTCTT CAGTATCCTT
CACTCTCCTC TTCCGGAAAG CGAAATCTGC TCATCGATGC TCGAAGGGCT GAGCCGGTAC
AGCTCTGCAA TGTGGGGCTG GCTCCACACC GCGGAGGCAC TTGGTGGTCT GGAAACGACA
TTACGTCAGC AGCTGCGTAA AATCGAAAAA GAGCTGCTGG TGTACGATCC TGAAACGCTT
GCAAAAAATG CCAGCGAGTA TGAAACCAGA GGCCATCTGC TCATGGGAGC ACTCTATCTT
GAACGCACGT CTCCGGACAG TATAATCGTT CCCGACCTCT TCAATCCGGG TAGGCCGGAT
ATCACCATCA CTCTCAAACC GAACCTCTCG CTCAGCGACA ACGCGTCGGA GTATTTCAGA
AAAGCCTCGA AAACAAGAGG CAAATCCTCA GCCCTGGCGC AGCGCAGAAC CGGGCTCGAG
CAGCGAAAAG CGATACTCGA ATCACTTTCG GCAGAACTCG CTTCGCTCGT TTCGCCGAAA
GCTGTAAAAC AGTTCATCGA TGCCAACCGG ACGAAGTTTC GGGGAACCGG CATGCCGGCG
GTTAAAACAG CTTCCGGACC CTCCTCCCGT TTCAGAACCG TTAAACTTTC GCCCTCGGTT
ACGCTCTATA TCGGCAAAAA TGCAAAAAAC AACGAACAGC TTACCTTCGC ATTTGCCAAA
CCCGACGATA TCTGGCTCCA TGCGAGGGGC AGCGCAGGAT CACACTGTGT GCTGAAAGGA
GCAACCATGC AGCATAAAGA GGAGATCCGC AAAGCTGCTG AAATCGCCGC CCGTCATTCG
GCGGCACAAC ACTCCGAACT GGTGCCGGTC ATGTACACAT TTAAAAAATA TGTCCGGCAT
TCAAAAAAAC TCCCGGTCGG ACAGGTTATC GTTGAACGTG AAGAGGTGAT CATGGTTCGA
CCGGCTAAAA ACGATGAATG A
 
Protein sequence
MQRNYFTLYH TAMELHERLA GGFVFEIHSQ QKNELTLSFV TAAGDHLQVV MVSRKPELCL 
FTREGMNRRK RQSANLLHTI CEREVTGVAM SPYDREILVG LSGGDTLVLR LFSASANAFL
VSEGIITDAC TSRNDLIGKP YREKDIHLAQ DIIRELELLA QDKQLFSLRI GHAAGPDNQD
SLNFLPGFDR TMLRALVERA GGNDDPDNLF NAFREIFYEL LDPLPQTGKT PEGKPLFSIL
HSPLPESEIC SSMLEGLSRY SSAMWGWLHT AEALGGLETT LRQQLRKIEK ELLVYDPETL
AKNASEYETR GHLLMGALYL ERTSPDSIIV PDLFNPGRPD ITITLKPNLS LSDNASEYFR
KASKTRGKSS ALAQRRTGLE QRKAILESLS AELASLVSPK AVKQFIDANR TKFRGTGMPA
VKTASGPSSR FRTVKLSPSV TLYIGKNAKN NEQLTFAFAK PDDIWLHARG SAGSHCVLKG
ATMQHKEEIR KAAEIAARHS AAQHSELVPV MYTFKKYVRH SKKLPVGQVI VEREEVIMVR
PAKNDE