Gene Plut_2029 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPlut_2029 
Symbol 
ID3746076 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium luteolum DSM 273 
KingdomBacteria 
Replicon accessionNC_007512 
Strand
Start bp2256153 
End bp2259455 
Gene Length3303 bp 
Protein Length1100 aa 
Translation table11 
GC content60% 
IMG OID637770060 
Productalpha amylase domain-containing protein 
Protein accessionYP_375914 
Protein GI78187871 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG0366] Glycosidases
[COG3281] Uncharacterized protein, probably involved in trehalose biosynthesis 
TIGRFAM ID[TIGR02456] trehalose synthase
[TIGR02457] trehalose synthase-fused probable maltokinase 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.269645 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTATCAAC CTGAACCGCT CTGGTACAAA GACGCCATCA TCTATGAGGC GCATGTCAAG 
ACGTTCTACG ACAGCAACAA TGACGGCATC GGCGATTTTC AGGGGCTCCG TCGCAAGCTT
CCCCATCTGG AGAGCCTCGG TGTAACCGCC ATATGGCTGC TTCCCTTCTA CCCCTCTCCG
CTGCGCGATG ACGGATACGA CATTGCCGAT TACATGACGG TCAACCCCGA CTACGGGACG
CTGGACGATT TCAGGGAGTT CCTCGAAGAG GCCCACAGCC GCGGGCTGAA GGTCATCACC
GAGCTGGTGG TCAACCATAC CTCAGACCAG CACGCATGGT TCCAGCAGGC ACGCAAGGCG
CCGGCAGGTT CCCCGGAGCG GAACTTCTAT GTGTGGAGCG ACGATCCCAA CAAGTATTCC
GAAACCCGCA TCATCTTTCA GGACTTCGAG GCCTCCAACT GGACATGGGA TCCTGTTGCC
GGCCAGTATT ACTGGCACCG GTTCTTCCAC CACCAGCCGG ACTTGAACTT CGAGAACCCC
GATGTCCAGC AGGCGCTCTT CGACGTGCTC GATTTCTGGC TCGGCATGGG TGTCGACGGC
CTCCGGCTCG ACGCCGTCCC CTATCTCTAC GAGGCTGAGG GCACCAACTG CGAAAACCTT
CCCGAGACCT ACGAGTTCCT GAAGAAGCTC CGCCGCCATG TCGATGAGCA TTTCCCGAAC
CGCATGCTGC TTGCCGAGGC CAACCAGTGG CCCGAGGATT CCGCAGCCTA CTTCGGCAAT
GGCGACCAGT GCCACATGAA CTTCCACTTC CCCTTGATGC CGAGGATGTA CATGGCCCTT
GCCACCGAGG ACCGGTTCCC CATCATCGAC ATTCTCGAGC AGACCCCGGA AATCCCGGAG
GGCTGCCAGT GGGCATCGTT CCTGCGCAAC CATGACGAGC TGACCCTCGA AATGGTGACC
GACGAGGAGC GCGACTACAT GCGCCGCGTC TACGCCAACG ACCCCAAGGC GCGCATCAAC
CTCGGCATCC GCCGCCGCCT GGCTCCGCTC ATGACCAACG ATCGCCGGCG CATCGAGCTC
ATGAACATCA TGCTCCTTTC GCTTCCCGGC ACCCCGGTGC TCTACTACGG CGACGAAATC
GGCATGGGCG ACAACTACTA CCTCGGTGAT CGCGACGGTG TGCGCACCCC CATGCAGTGG
AACAGCGACC GCAATGCCGG GTTCTCCCGC GCCAACCCCC AGCAGCTGCT GCTGCCTGTC
ATCATCGATC CCGAGTACCA CTACGAGGCG GTGAACGTCG AGGTGCAGGA GAGCAACGTC
AACTCGCTGC TCTGGTGGAC CCGCCACGCC ATCGCCACCG CGCGTCGCTA CAAGTCGCTC
AGCCGCGGCT CGATAGAGTT CCTGCAGGTC AGCAATCCCA AGGTGCTCAT CTTCATCCGC
CGTTACGAAG ACGAGACCAT CCTGTCGGTC ATAAACCTTT CGCGCAACGC ACAGGCGGTC
TCCGTCGACC TCTCCGCCTT CGAAGGCTAC ACTCCCGAAG AGGTGTTCAG CATGAACCGC
TTCCCGAAAA TCCGTCAGGC GCCCTATATG CTCTCGCTCG GCTCCTATGG GTATTTCTGG
CTGAAGATGG TGCCGGATAC CGCCGATGTA TCCTGCCGTC CCGGTCTGGA AACGGCGTTC
GCCGAAGTGT ATAACTGGCC TGCGCTCTTT GCCGGCAGAA GCCGAGAGAA GCTTGAAAAC
TCGGTCCTGC CCTGCTACTA CCAGGCTATG CGCTGGTTCG GCGGCAAGGC GCGCAATGTC
CTTCGCATTT CGATCGTCGA CACCATTCCG GTTGAGGGTA TGGAGCATGC CAAGCTTCTC
GTCACCGAAG TCCGCTACTC CAGCGGAGAG AACGAACTCT ATCAGCTGCC TGTCAGCTTC
GTTCCACTGG AGGAGGTATC GCCTTCCGAC GACAACTTCT ACCGGAGGGT CATCGGCCGC
GCCGTTGTCG GGGAAGTGGA CGGGTACCTG TGCGACGCTA CCTACGAGAA CGCGTTCCTC
AGCCGCCTCT TCAGCATCAT GGTCGGCGGT GAGCAGTGGA AAGGAAAGGC GGGTATTGTT
TCCGGCATGA AAGGATCCGC TCTCGAAGTG TTCCGCGACG TGACGGACGG GTCGCAGCCC
GAACCCTTCC TCATGGGGGT CGAGCAGACC AACACCTCAA TCCGCTTCCA CGATTCGCTC
TGTTTGAAGC TCTACCGCCG GATCGAGAAG GGCGTTTCTC CCGAAATCGA GATGTGCGGC
GCCCTGACGG AAAAAACCGG TTTTGCAAGC CTCCCGAAAT ACCTCGGTTC GCTCAACTAC
GATCAGGGCC GCAATGCCGG TTACTCCATC GGCATCCTGC AGAATTTCGT GGAGAATGAA
GGTGATGCAT GGCAGCTCTC CCTCGGCCAG GTTGCCCGTT ACTTCGGAGA CGTACAGGCC
AAGGTTGGCG CCGGCATCGA GCTTCCGTCC ATGCAGCAGC TCAGCGGCAG CCCCGTCGAG
GTCCCTGAGC TCATGCATGA GCTTATCGGT GGAGCATATC TCGGCATGAT CGAGAAGCTG
GCCGAGCGTA CCGCTGAAAT GCACCTCTCG CTGGCCTCGC TTGACGCCGA CCCTGCGTTC
GCCCCGGAGC CCTTCACCAC CCTCTACCAG CGCTCAATCT ATCAGGCGAT GTGCGAGCAG
GTAAAGCGTG CCATGATCCT CATCGGGGAG CTTATGCCTT CGATGGAGCC GGAACAGCGT
GAGCTCTGCT CGCTGCTCGT CAGGAACCAG AAGGACATCC TGCAGCAGTT CGAGCCGGTC
CGCCAGGAGA AGATCGATAC CCTGAAAATC AGGATCCACG GCGACTATCA CCTCGGCCAG
GTGCTCTTCA CCGGCAAGGA CTTCGTCATC ATCGACTTCG AGGGCGAGCC GGCCCGTCCG
ATTTCCGAAC GCAAGATCAA GCGCTCGGTC TTCCGCGATA TCTCCGGCAT GCTCCGTTCA
TTCGACTATG CGGCCTTCCA TGTGCTCCAC CTCAACGAAT CGGTTGTTCG CGCTGAAGAC
CGTCACCAGA TGGAACCCTG GGCCGACCGC TGGAGCAATG CCGTCGGGCA GCACTTCCTC
GACTCCTACT TCAAGAGCAC CGAGGGCAGT GCCATCGTCC CTGAGGATCC CCGTCAGCGG
GAACACCTGA TGAACGCCTA CCTTATGAAC AAGGCGGTCT ACGAGCTCAA CTACGAGCTC
AACAACCGCC CGCAGTGGGT TGGTATCCCG ATCAGGGGCA TTCTGAAGAT GCTCGATATG
TAA
 
Protein sequence
MYQPEPLWYK DAIIYEAHVK TFYDSNNDGI GDFQGLRRKL PHLESLGVTA IWLLPFYPSP 
LRDDGYDIAD YMTVNPDYGT LDDFREFLEE AHSRGLKVIT ELVVNHTSDQ HAWFQQARKA
PAGSPERNFY VWSDDPNKYS ETRIIFQDFE ASNWTWDPVA GQYYWHRFFH HQPDLNFENP
DVQQALFDVL DFWLGMGVDG LRLDAVPYLY EAEGTNCENL PETYEFLKKL RRHVDEHFPN
RMLLAEANQW PEDSAAYFGN GDQCHMNFHF PLMPRMYMAL ATEDRFPIID ILEQTPEIPE
GCQWASFLRN HDELTLEMVT DEERDYMRRV YANDPKARIN LGIRRRLAPL MTNDRRRIEL
MNIMLLSLPG TPVLYYGDEI GMGDNYYLGD RDGVRTPMQW NSDRNAGFSR ANPQQLLLPV
IIDPEYHYEA VNVEVQESNV NSLLWWTRHA IATARRYKSL SRGSIEFLQV SNPKVLIFIR
RYEDETILSV INLSRNAQAV SVDLSAFEGY TPEEVFSMNR FPKIRQAPYM LSLGSYGYFW
LKMVPDTADV SCRPGLETAF AEVYNWPALF AGRSREKLEN SVLPCYYQAM RWFGGKARNV
LRISIVDTIP VEGMEHAKLL VTEVRYSSGE NELYQLPVSF VPLEEVSPSD DNFYRRVIGR
AVVGEVDGYL CDATYENAFL SRLFSIMVGG EQWKGKAGIV SGMKGSALEV FRDVTDGSQP
EPFLMGVEQT NTSIRFHDSL CLKLYRRIEK GVSPEIEMCG ALTEKTGFAS LPKYLGSLNY
DQGRNAGYSI GILQNFVENE GDAWQLSLGQ VARYFGDVQA KVGAGIELPS MQQLSGSPVE
VPELMHELIG GAYLGMIEKL AERTAEMHLS LASLDADPAF APEPFTTLYQ RSIYQAMCEQ
VKRAMILIGE LMPSMEPEQR ELCSLLVRNQ KDILQQFEPV RQEKIDTLKI RIHGDYHLGQ
VLFTGKDFVI IDFEGEPARP ISERKIKRSV FRDISGMLRS FDYAAFHVLH LNESVVRAED
RHQMEPWADR WSNAVGQHFL DSYFKSTEGS AIVPEDPRQR EHLMNAYLMN KAVYELNYEL
NNRPQWVGIP IRGILKMLDM