Gene Clim_1572 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_1572 
Symbol 
ID6354220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp1691791 
End bp1693122 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content53% 
IMG OID642669176 
Producthypothetical protein 
Protein accessionYP_001943598 
Protein GI189347069 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000183741 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGAAAGC CTGAACGCCG GGATTTTTAT TCTTTTGCAC GTATTGTATC GCTGTTTCTC 
GGGATTTGCG GGCTCAGTGT GCTGAGCCTC TCTCTCTACT GGAGGGTTGT GGTGCCTGCT
GAACCTCATA TGTCAATGCA TTTCAGTCCT GTTTACAAAG AGTTCGTTCT CAGCGGAAGA
TGGAGCGGTT CAGTTCCGGA GCCGAATGAC GGACAGCGAT ACTGGAGGGA AGATGGATAT
CCTCTCGGCA GGGAGCAGTA TCGTGCAGCG CTACCGTTTC TGTATGTCAG GGACCTTGTC
AAGTGGGATG CGTTTCCTGA AACGATCGCA GGTTTGGCAG TTTCTCCGTT CGACGCTGAA
CAGTCTTGGC AGTTCATGCG GCTGTTTTCC GAAGACTGGA ATGCTCCTCC ACCGATGCTG
CATATGTTCA TTGAGTCGAA ACCTCATGGC GCAAGATTGC AGAAACCGGA TGATTTTTTC
AGGGTTTCGT CATCCGGAAA CGGCATTGAA TTCCTGACGC CCCGTGAGGG TAAGGTGGAC
AGTCTTAAAA GCAGCAGCTT TACCCGAGCC CTCCGCGCGG CAGGTTTTGT GTTTCCCGTA
ATGGAGCTGG GTGGAAATCC TGATGTGAGA AAGCGGTATG ATGCGGGATA TTTTATTGCT
GATTCGAAAG GGTTCTTGTT TCAGATGCAG ATGGTTGACG GACAACCATC ATGCCGGCGC
CTGCAGCCGC AGATCAGCGG CAAGATAAGG TACATAGCTG TCAACGAGCA TCATCGCGAA
GAGTTTTTCG GTTTTGTCGC AACCGATGAC GCCCTGTATG CCATCATGCA GAAGGAGAGA
CGGCTCAAGC AGCTGCCGAT CGGGAAATTC GATGCCGATG CGCTCAGACT TGCGATATGG
TCCGATATTC TCTACACCTC CGTGTTCATC GAGAGTCCCG GTATGCCGGG GTCGGGAATT
GAAGGCATTG CCATGACCCC CGATTTCAAA GTTGTTCGTC GATACGTCCA GGCGCAGGAT
TCGGACTATG CTGGTTCAAT GCGCCGGCTC GATACCATCG CCTCTTTTCT TTTCCCGCTG
CAGATCGTCA GTGGAATACC GGGGTCTTCC TTCCGAGACA TGCGTGCAGG GCCAGGAGGA
GATCTGTCGG CAGTGCTTGC CGGCAGCGTT TTCGCGCTTG TCGCTTTCAT ACGTGTTACC
CGGTCAGGAT TCCGAAAGAG TGTGAGGCCG TGGAGCGATT ATGTGTTTGT CGCCGTTTTC
GGTTTTGTTG CCATAGCGAT GATACTGATT GAAGACGCCG ACCAACATGT GCGGGTGCTG
CATAACAGCT GA
 
Protein sequence
MGKPERRDFY SFARIVSLFL GICGLSVLSL SLYWRVVVPA EPHMSMHFSP VYKEFVLSGR 
WSGSVPEPND GQRYWREDGY PLGREQYRAA LPFLYVRDLV KWDAFPETIA GLAVSPFDAE
QSWQFMRLFS EDWNAPPPML HMFIESKPHG ARLQKPDDFF RVSSSGNGIE FLTPREGKVD
SLKSSSFTRA LRAAGFVFPV MELGGNPDVR KRYDAGYFIA DSKGFLFQMQ MVDGQPSCRR
LQPQISGKIR YIAVNEHHRE EFFGFVATDD ALYAIMQKER RLKQLPIGKF DADALRLAIW
SDILYTSVFI ESPGMPGSGI EGIAMTPDFK VVRRYVQAQD SDYAGSMRRL DTIASFLFPL
QIVSGIPGSS FRDMRAGPGG DLSAVLAGSV FALVAFIRVT RSGFRKSVRP WSDYVFVAVF
GFVAIAMILI EDADQHVRVL HNS