Gene Clim_2181 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagClim_2181 
Symbol 
ID6355975 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium limicola DSM 245 
KingdomBacteria 
Replicon accessionNC_010803 
Strand
Start bp2419916 
End bp2421046 
Gene Length1131 bp 
Protein Length376 aa 
Translation table11 
GC content49% 
IMG OID642669772 
Productprotein of unknown function UPF0118 
Protein accessionYP_001944184 
Protein GI189347655 
COG category[R] General function prediction only 
COG ID[COG0628] Predicted permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.00612132 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGCTGCAT ATGAAAAGCA AACGATCAAC AAAACAGGGA TGACGATGAA CAGGTTTACT 
GCCAGCAAGG TTGTGCTTCT CTTGATTGTC TTTGTGATCT CGGCGCTTTT TTTTGCCATG
ATCCGGTATT TTTTCATGGC GATTTTTCTT GCGGCAATTT TTTCGGCATT GTCCATGCCG
ATTTACAGCC GGATCGAGCG ATTTGTAAGA GGACGGAAAA ATCTGAGCTC GGCATTGACC
ATGATTTCGC TTTTTATCAT GGTTTTTCTT CCGTTTACGG CAGTTATGGG CATTGTCGCC
GTTCAGGCCG TCAACATCAG CCGGGCGGCT GTGCCGTGGA TTCAGGCCCA ACTCAAGGAA
CCGGCCACGT ATAACACCAT GCTGCAGTCG TTTCCCTATT ACCGTGAACT GGAACTGTAT
CGCGAAGAGA TTCTGCAGAA AGCGGCCGAA TTGGCCGGGA CTGCCGGGAC TTTTCTCTTT
AACAGTCTTT CGTCGATTAC CGTGACAGCG ATGAACGAGC TTTTTCTGAT GTTCATTTTT
CTCTATACCA TGTTCTTTTT TCTCAAGGAT GGAAGGCTTC TGCTTGAAAA AATCATGTAC
TATGTTCCTT TGGATGAGTC GGATCAGTAT CGTCTGCTTG ACCGTTTTCT TTCGGTAACC
CGGGCAACGC TCAAGGGAAC CATGGTTGTC GGTCTTATTC AGGGATCCGT TGCCGGTCTG
GCCCTGCATC TTGCCGGCAT CGAAAGCGCT CTGTTCTGGG GAACGATCAT GAGCGTGCTT
TCGGTCGTTC CGGTGCTCGG TCCTCCGCTT GTCTGGCTGC CGGCGGCAAT CTATCTTGCT
GTAACAGGTC ACTATACCGA AGCGGCAGCT GTTTTTCTTT TCTGCAGCAT TATAGTCAGT
CAGCTCGACA ATGTGCTTCG TCCCATTCTC ATCGGTCGCG ACACGCAGAT GCATGAGCTT
ATGATCTTTT TCGGTACCCT CGGTGGTCTG GGGTTGTTCG GCCTTTTCGG TTTTATTATC
GGCCCGATTG TAGCCGCTCT GTTCATTACC GTTTGGGAAA TCTACGGTGA GACATTCAGC
GATTACCTGC AAGAGGTGAA GCGGAAGAGT GAGCGTCGTA TCGATAGTTG A
 
Protein sequence
MAAYEKQTIN KTGMTMNRFT ASKVVLLLIV FVISALFFAM IRYFFMAIFL AAIFSALSMP 
IYSRIERFVR GRKNLSSALT MISLFIMVFL PFTAVMGIVA VQAVNISRAA VPWIQAQLKE
PATYNTMLQS FPYYRELELY REEILQKAAE LAGTAGTFLF NSLSSITVTA MNELFLMFIF
LYTMFFFLKD GRLLLEKIMY YVPLDESDQY RLLDRFLSVT RATLKGTMVV GLIQGSVAGL
ALHLAGIESA LFWGTIMSVL SVVPVLGPPL VWLPAAIYLA VTGHYTEAAA VFLFCSIIVS
QLDNVLRPIL IGRDTQMHEL MIFFGTLGGL GLFGLFGFII GPIVAALFIT VWEIYGETFS
DYLQEVKRKS ERRIDS