Gene Plut_2055 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlut_2055 
Symbol 
ID3746197 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium luteolum DSM 273 
KingdomBacteria 
Replicon accessionNC_007512 
Strand
Start bp2284867 
End bp2286003 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content61% 
IMG OID637770086 
Productglycosy hydrolase family protein 
Protein accessionYP_375940 
Protein GI78187897 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1472] Beta-glucosidase-related glycosidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTCAGA GGGTCCCCGT CTTTCTTGCG CTGCTGCTCT TCGCCCTCCT CCTCCAGCCA 
CCGCCCATCG CAGAAGCGGG AACGAAGCCG GACAGCCTAT CCATTAAAAT CGGCCAGATG
CTGATGGTGG GCTTCAGGGG CACCACCATC GGCGATGCCC CTGACGTACG GCGCGCCATC
GACCGCCAGC GCATCGGCGG CGTGGTGCTC TTCGACTATG ACGTCCCGTC CCGGACCCCG
CTTCGCAACA TAACCGGCCC GGAACAGCTT CAGCGGCTGA ACGGCGAACT GCAGGAGCGT
TCCCCCGTCC CGCTCTTCAT TTCGATCGAC CAGGAAGGCG GCATGGTCAG CAGGCTCAAG
CCGGCAAGGG GGTTCCCCCG GAGCCCGAGC GCCCGGAGCC TCGGACTGCT CAGGAACCCC
GACAGCACCC TCGCTGCAGC GGAGGTGACT GCCCGGACAC TGCAGTCGAT GGGAGTCAAC
ATGAACCTTG CGCCGGTGGT CGACCTTGAT ACCAACCCGC AGAATCCCGT CATCGGCCGT
ATAGAGCGGA GCTACTCGCC TGACCCCGAC ATCGTCTCGT CTCAGGCCGC CATCGTCACA
ACGACCTTTC TCCGCGAGGG GATCATCCCT GTCCTGAAAC ATTTTCCCGG CCACGGCAGC
TCGACCTCGG ACAGCCATCT GGGCTTCACC GACGTAACGG AAAGCTGGAG CGAGATTGAA
CTTGAACCAT ACCGCAGCCT CTTGCTTGAT GGATATCAGG GTGCCATCAT GACCGCCCAC
GTCTTCAACG CCCGCCTCGA TCCCCGCTAT CCGGCGACCC TTTCAAAGGC GACCATCAGC
GGCCTACTAC GAGAAAAGCT CGGGTTCCGT GGGGTGGTGC TTACCGATGA CATGCAGATG
GGCGCCATTG CCCAGAACTT CGGCTTTGAA GAAGCGGTCC GCCTGTCGAT TGAAGCCGGT
GCTGACATTC TTGTGTTTGC CAACAATACG GCCGTCTACG ACCCAAAAAT CGCAGAAAAG
GCATCAGGCA TCATCCGCAG GATGGTGGAT GAGGGCATAA TTTCTCCCCT TCGCATCGAG
GAGTCGTACC GGAGGATCAT GACACTCAAA GAGACTGTAA CCCACCCTGC CAGATGA
 
Protein sequence
MSQRVPVFLA LLLFALLLQP PPIAEAGTKP DSLSIKIGQM LMVGFRGTTI GDAPDVRRAI 
DRQRIGGVVL FDYDVPSRTP LRNITGPEQL QRLNGELQER SPVPLFISID QEGGMVSRLK
PARGFPRSPS ARSLGLLRNP DSTLAAAEVT ARTLQSMGVN MNLAPVVDLD TNPQNPVIGR
IERSYSPDPD IVSSQAAIVT TTFLREGIIP VLKHFPGHGS STSDSHLGFT DVTESWSEIE
LEPYRSLLLD GYQGAIMTAH VFNARLDPRY PATLSKATIS GLLREKLGFR GVVLTDDMQM
GAIAQNFGFE EAVRLSIEAG ADILVFANNT AVYDPKIAEK ASGIIRRMVD EGIISPLRIE
ESYRRIMTLK ETVTHPAR