Gene Tneu_0235 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0235 
Symbol 
ID6165917 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp207645 
End bp209084 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content65% 
IMG OID641667398 
Product4-alpha-glucanotransferase 
Protein accessionYP_001793634 
Protein GI171184715 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG1640] 4-alpha-glucanotransferase 
TIGRFAM ID[TIGR00217] 4-alpha-glucanotransferase 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value0.788709 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.23673 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTTCGGG GCTTTGGAGT ACTTCTCCAC ATATCTAGCC TCCCCGGGGG TTGCCTAGTC 
GGCGACCTGG GGCCCTCCGC CTATAGATTC GCCGACTTCC TATCCGAAGC CGAGGCCACC
TACTGGCAGA TCCTGCCGCT GAGCCACACG CTACCTGAAT ACGACGACTC CCCCTACAGC
GCAGCCTCGC TGCTGGCTGG AAACCCGGCC CTCGTCAGCC TGGAGAAGAT GGCCCAGCTG
GGGTTGGCGA AGAGGGCGCC GCCCAGCTGT CCGCCCGCCG AGAGGGCGCG TTTCGCAGAG
GCTTGGGAGC TCAAGAGGCG GTATCTTGAG GAGGCCTTCG AGGGAAGGCT GGGCTGGCGG
GATTACGAGG AGTTCGCCGC CCGAAATAGC TGGTGGCTGG AGCCCTACGG TAGATACATG
GCGCTAAGGG AGGCCTTCGG GGGGCCGTGG ACCGCCTGGC CCGCCTGGGC GAGGAGACCC
AACGCCGATC TGCCGCCACG CCTAGAAAGG AGGGCGGATT TCTACAGATA CGTCCAGTTC
CACTTCTGGC TACAGTGGGA GGAGCTGAAG AGATACGTCA ACAGCCTCGG CGTATTTATC
ATAGGCGACC TCCCCATATA CCCGGCGTTA GACAGCGCCG ACGTGTGGGA GGGGCAGAGG
TACTTCAAGC TGGCGCCCGA CGGCGCCCCC CTCTACGTCT CCGGGGTTCC GCCTGACTAC
TATTCACCCA CCGGACAACT ATGGGGGACG CCGGTCTACA ACTGGGCGGA GCTGAGGAGA
GACCGCTACG TCTGGTGGAC CCGGCGCCTT ACGAGGCTAC TCTCCATATT CGACTACATA
CGCCTCGACC ACTTCAGAGG ATATGCGGCG TATTGGGAGG TGCCCTACGG GGAGCCCACG
GCCGTAAGGG GGAGGTGGGC GCCGGGGCCC GGCGAGGAGC TTTTCAGAGC CGCCGAAGAT
GCCCTCCCCA GGCTCATCGC GGAGGACCTG GGCTTCATCA CCCCAGACGT TGTGGAGCTC
AGGTATAGGC TGGGCATACC CGGCATGCGC GTGCTCCAGT TCGCATGGGA CGGCAACCCC
GCCAACGAGC ACAAGCCGCA CAACTACGAG AGGAACCTTG TGGCGTACAC CGGAACACAC
GACAACAACA CCACCCTAGG CTGGTGGAGG GAGGAGACAA CGCCGAGGTC GAGGCGCGAG
GCCCTCGCCT ACATGGGCGG CTGCAGAGGC GGTGTGAGCT GGTGCTTCAT ACGCCTCCTC
TTCTCCACCG TGGCCGACGT GGCCGTAGTC CCGATGCAGG ACGCCCTCGG GCTAGGTAGT
GAGGCTCGGA TGAACAAGCC CGGCACCGCG AGGGGCAACT GGAAGTGGAG GATGGCCGGA
GACCCGCCCC GGGCTGTGGC GGCGCGGCTC AGGCGCCTTG CAAGGATCTA CGGGCGCTGA
 
Protein sequence
MLRGFGVLLH ISSLPGGCLV GDLGPSAYRF ADFLSEAEAT YWQILPLSHT LPEYDDSPYS 
AASLLAGNPA LVSLEKMAQL GLAKRAPPSC PPAERARFAE AWELKRRYLE EAFEGRLGWR
DYEEFAARNS WWLEPYGRYM ALREAFGGPW TAWPAWARRP NADLPPRLER RADFYRYVQF
HFWLQWEELK RYVNSLGVFI IGDLPIYPAL DSADVWEGQR YFKLAPDGAP LYVSGVPPDY
YSPTGQLWGT PVYNWAELRR DRYVWWTRRL TRLLSIFDYI RLDHFRGYAA YWEVPYGEPT
AVRGRWAPGP GEELFRAAED ALPRLIAEDL GFITPDVVEL RYRLGIPGMR VLQFAWDGNP
ANEHKPHNYE RNLVAYTGTH DNNTTLGWWR EETTPRSRRE ALAYMGGCRG GVSWCFIRLL
FSTVADVAVV PMQDALGLGS EARMNKPGTA RGNWKWRMAG DPPRAVAARL RRLARIYGR