Gene Tpen_1451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTpen_1451 
Symbol 
ID4601160 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermofilum pendens Hrk 5 
KingdomArchaea 
Replicon accessionNC_008698 
Strand
Start bp1401556 
End bp1402890 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content59% 
IMG OID639774226 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_920851 
Protein GI119720356 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3833] ABC-type maltose transport systems, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0294771 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGGGCGC AGCAGAAGGC TAGGAGGAGC GACTTCTTCA AAAGCGCGGT GCTGACGCTG 
CTCGCGCTCG TCGTTATGGG CGTACTGCTC TTCCCCGTTT ACTACATGTT CATGGTCTCG
CTTAAACCCG TTGGAACCCT AGCCACTACG AGCCTCGAGG TAATTCCGAG CAAGGTTACG
CTCGACAACT ACCTAGAGAT ACTCGTGGGT CACTACGAGG CGACGCTGGA CGTGAAGAGC
TTCGCGCTCC GCGCCCAGAA CGCCACGATA TCGGATGCCC TGAACCGCTA CGAGGTCGAC
CTGTATGACG GAGTCGTGGC CGGCGACTAC CCCGTGAAGT TTACTCTAAG CAACGCTAAG
ATTCTGGAGA GGAGGGGAGG ACAGGAGAGA GGTGAGAGGG ACGCCACGAT AATAGTCGGC
GGGGACTACC TGAAGGTGGG CGCGGACTCC GCGGAGACGA TAACGGCGGC TAGAAGGCTG
ACGGTTGAAG CGAGGAAAAT CGTCGTGAAG GTCAGTGGTC CCGGCGGAGC GCCCCTCGAC
CTCTCGAAGT TCAGAGAGGT GGCTCCGGGC GTCTACGAGG CGGAGAACGC CAGGCTGTAC
CTGGAGGACG GGGGGAGGAT CTCGGCGGAG AAGTGCACCG TTGAGACTAC CAGTTTCAGC
TACATAAGGC TGGCGAAGGT CGGGGGAGAG ATATGGGGCT ACATGAGTAG GAGCCTGATA
ATCGCAAGCC TAACGGTGGT TCTAACCCTG CTCTTCGTGG TGCCGTCCGC CTACGCGTTT
TCGAGGCTCA AGTTCTTCGG GAAGGGGCAC ATACTCTACT CTTACCTCAT GTTCACGCAG
GTAGCGGGAG GACTGGGGAT AGCCGGGCTC GTAGCCCTGT ACGGCATGCT CGTTAGGCTC
AACCTGGTGA ACAACATCTT CGTGCTACCG GTGATCTACG CGGCGGGGAG CGTCCCGTTC
AACACGTGGC TCCTCAAGGG GTACCTGGAC TCCATAAGCC CGGATTTCGA CGAAGCCGCC
CTCGTAGACG GGGCGAGCTA CGCGCAGATC ATAGGGCAGG TCCTAGTGCC GATGGCGCTA
CCGGGTATAG CGACGGTCGC AATCTTCTCC TTCATCGGGG GGTGGACGGA GCTCATACTT
GCGAACCTGC TGCTCAACCA GGAGAACCAC CCCTTGACCG TCTACATCTA CGTGTTGCTC
ACCAACCTCA GGAACGTGTC CTGGAACCAG TTCGCGGCCG CCGCTCTGAT CTTCGCCCTC
CCCGTCGTAG TGATGTTCCT GCTGGCCCAG AACTACGTCA GAAGCGGGTT GACGATGGGA
GGACTAAAAG AGTAA
 
Protein sequence
MRAQQKARRS DFFKSAVLTL LALVVMGVLL FPVYYMFMVS LKPVGTLATT SLEVIPSKVT 
LDNYLEILVG HYEATLDVKS FALRAQNATI SDALNRYEVD LYDGVVAGDY PVKFTLSNAK
ILERRGGQER GERDATIIVG GDYLKVGADS AETITAARRL TVEARKIVVK VSGPGGAPLD
LSKFREVAPG VYEAENARLY LEDGGRISAE KCTVETTSFS YIRLAKVGGE IWGYMSRSLI
IASLTVVLTL LFVVPSAYAF SRLKFFGKGH ILYSYLMFTQ VAGGLGIAGL VALYGMLVRL
NLVNNIFVLP VIYAAGSVPF NTWLLKGYLD SISPDFDEAA LVDGASYAQI IGQVLVPMAL
PGIATVAIFS FIGGWTELIL ANLLLNQENH PLTVYIYVLL TNLRNVSWNQ FAAAALIFAL
PVVVMFLLAQ NYVRSGLTMG GLKE