Gene Clim_2079 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2079 
Symbol 
ID6355057 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2293451 
End bp2294704 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content52% 
IMG OID642669675 
Productdomain of unknown function DUF1745 
Protein accessionYP_001944087 
Protein GI189347558 
COG category[S] Function unknown 
COG ID[COG3287] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.00286991 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGCAA TTTCAAAAAT ACAAGCCGGC TCAGGAGTGA GCGAGCTGAC GGATTCTTAT 
GCGGCAGGCG TTAGCGCCGC TTCCGAAGCT CTTACGGGGA TTTCAGGCAG AACACCTCAG
GTACTGATCG TATTCGGTGC GATGCGTTTC GATCATCGTG AACTGCTCAA AGGTATAATG
TCCGTAGCTG GTGATATTCC TCTGGTTGGG GGAACAACTG CCGGCGAGAT ATCGACCGGC
GGTTTTTCGA CCGGCTCGGT CGTGGTTATG GCGTTAGCTT CGGATTATCT GCATTGTGTT
GAGGGCATAG GCCACGACAT GAGTAGTGAC GAAGCTGCAT GCGCCGTTGA AATGGCCACA
GATATCCTCT CAAAGGGTTC ATTCGACAAG GACGCATCCC TCATGGTTTT TCCCAATGGC
ATGGGAGGAG ACGGCCTTCG TGTTCTTGAC GGTCTTCATT CCGTTTTAGG CCGGGGTTTT
GAAATTTCAG GCGGGTTTCT GGGAGATGAC GAGCGCTTTC AGAGCACCTT TCAGTACTAT
AACGGCAAGG TATACAAAGA TGCCATTGTC GGACTGATGA TCTCCAGAAA GGATTGCATC
AGGACAGGTA TCGGAGTCGG GAGCGGTTTC GAGTCGATCG GTAACAGTTT CGTTTGTACC
GCCTCGGAAG GTAATCTTGT CACGGAGTTT GACCATGTTC GGGCTCTTGA TCTTTACAAG
GATTTTCTTG GTGAAGAACG TTCTGCACGT CTTCCAGGGG TTTGCCTTGA ATATCCGTTC
GGGCTTATCG ATCCGGCGCT GTCCACAGGG GGGGAAGAGC TGTTCCAGTT GCGTTGCGGC
CTTTCGGTCG ATCATGCAAA AGGCACGATC TCTCTTGCTG CTTCGATTCC ATCCGGAAGT
TCAGTAACGC TGACAACTGC CTCAAGGGGA GATATCATAC AAGGGGCGCG TTTTGCTGCC
GAACAGGCAA AGGCATGTCT GTCAGGGGCT ATTCCGCGAC TTGTCGTCAT GTTCAGTTGC
GTGGGGCGCA AGCTTGTTCT CGGAAGACGT ATACAGGAGG AGGCGGCCAC CATTAAGGAG
TGCCTCGGCA GCGATGTTCC GTTGATCGGA TTCTACACGT ATGGAGAAAT AGGCCCGGTC
AACAAAATGA AGCCAGGCTT TGAGACAGCT AAATTTCATA ACGAAACGGT TGTGCTCTGG
GTGATCGGCG AAGAGCATTC CGGATCGCCT GCGACAGCCG TTGTGCGGCC TTGA
 
Protein sequence
MNAISKIQAG SGVSELTDSY AAGVSAASEA LTGISGRTPQ VLIVFGAMRF DHRELLKGIM 
SVAGDIPLVG GTTAGEISTG GFSTGSVVVM ALASDYLHCV EGIGHDMSSD EAACAVEMAT
DILSKGSFDK DASLMVFPNG MGGDGLRVLD GLHSVLGRGF EISGGFLGDD ERFQSTFQYY
NGKVYKDAIV GLMISRKDCI RTGIGVGSGF ESIGNSFVCT ASEGNLVTEF DHVRALDLYK
DFLGEERSAR LPGVCLEYPF GLIDPALSTG GEELFQLRCG LSVDHAKGTI SLAASIPSGS
SVTLTTASRG DIIQGARFAA EQAKACLSGA IPRLVVMFSC VGRKLVLGRR IQEEAATIKE
CLGSDVPLIG FYTYGEIGPV NKMKPGFETA KFHNETVVLW VIGEEHSGSP ATAVVRP