Gene Tfu_2068 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTfu_2068 
Symbol 
ID3581614 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermobifida fusca YX 
KingdomBacteria 
Replicon accessionNC_007333 
Strand
Start bp2420965 
End bp2423022 
Gene Length2058 bp 
Protein Length685 aa 
Translation table11 
GC content69% 
IMG OID637685764 
Productputative integral membrane protein 
Protein accessionYP_290124 
Protein GI72162467 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0278273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGGCA ACGACCACTA TGACGACCGC CCGCGCCGCG CACGCCGGCC CGACCCGTTG 
ACGGACCCCT GGCCGCCGCC ACGGGATACC GACGCGGCCC CTACCGGGGC GAGGGCCGCC
GACTATGAGG GACGCGGATA CCGGGACGAG TACGGGGTAG GGCACCGGCG CGGCACGGAG
GTCCCCCGAC AGGACGGCTT CGCATGGCAG CCCTCCTCCG CCACGCAGAC CACCGCTCCC
ACCGGGGGAG CGGAACCGCT GGACGGGACG GAACGCCCCC GCGGACGGCG GAGACGGCCG
GGGCCCGCGG ACACCGGCTC CTTCCCGTTC CCGGTGTCGG AAGCCGAAGC CTTGCGTGAG
CAGCCGCGCG GACGGCGCCG TAGACCCGAG AGCGACGAAT CCGCCCGCTG GGCCCCCGAG
TCGGACCCGC TATTCGACGG CAGGGGGGCA CAGCCCGCAG GGCCGCGGTC GGCAGAACGC
TACGACCAGC CGCCCGACCC TTTGACGGCT TCGGCGTATG CGGGGCGCCG CGCCAGGTCC
GCCCACCCCG CGGAATACCC GCAAGGGCGG GGCACCACCC CGCTCCGCGA CGAGCAGCGG
CCCGGTACAG CGTGGTCGGA TGATCCGCTG TGGTCCCCTG CCGACCCCGG CAGCCGCAGA
GAAAACGACG ACATCCTGGA CGGCCGCACC CGACGGCGGC AGCCGAGCGC CGACGAGGAC
CGGGACATCC CCAGACGAGG CCGCCGAGCC CGTCCTCCCG CTTTTGACGA AGCGGACAGC
GCCCCGCCGC GGGGACGCCG CTCCCGTACC GTCTCCCCGC AGACCGGGGA GACACCGACC
GCCACGGCGA GCCTAAAGAC ACGCGAGACC GATCTCGACG ACGAAGACTC TGACGCACTC
CCGTTCCCGT TCACCGAGGA CGAAGACGAC GAGGCCCCGC CCCGGCGCTC CCGCAGGGGC
CGCCACAGCG GCCGACGGGC TGCCGGTCGC CGAGCCAAGA GCAGACGGCA GCGGAAGAAG
AGCAAAGTCG CACTCGCCAG CGCCCTGATC GTGCTGAGCC TGTTCCTGGT GTCCGCCGGG
ACAGGCGGCT ACCTGCTCCT GCGTACCTAC ATCATTCCTC CGGACTATTC CGGGGAAGGC
AACGGCGAAG TCGACATCGT CATCGAGGAA GGCGACTCCG GAACCGTGAT CGCGGAGAAG
CTGCACCAGG CCGGGGTGAT CGCGAGCGTG CGCGCGTTCA CCAACGAAAT CCGCTTCTCC
GACATCAACT TCGTGCCCGG CACTTACCGG ATGCGGCTGG GGATGAGCGC GGAGGCGGCG
GTCGCCCTGC TGCTCGACCC GGAGAGCCGT ATCGCGCTGA ACGTGACTAT TCCGGAAGGA
CTGCGCGCCG AGCAGATCCT GGACCGGCTG GCTGAGCAGA CCGGAATCCC CCGGGAAGAG
TTCCAGGAGG CCTACGAGGA CCACGAGTCG CTGGACTTGC CCGAATACGC CACTCAAGGG
CCGGAAGGGT ACCTCTTCCC CGAGACCTAC GAGTTCGACC GCAGCGCCTC TGCGACTGAG
ATCCTCCAGC AGATGGTGGC GCAGTACCGG AAGGTCGCTG CCGAGATCGA CCTGGAGAAC
CGGGCAGCAG AGGCAGGATT CGACCCGAAC GAGATCATGG CGATCGCCGC GATCGTGCAG
GCCGAGTCCG GAAAGATCGA GGACATGGGG AAGGTCGCCC GGGTCATCTA CAACCGCCTG
GACGACGGCA TGTACCTCAA GATGGACAGC ACCTGCTTCT ACGCCCTCGG CGAGTACGGC
ATCGCGATCA ACCGGGACCA GCAGGACCGG TGCCGCAACG ACGAGACCGG ATACGACACC
TACTTCCACG AAGGACTCCC GGTCGGCCCT ATTGTCAGCC CTGGTAAAGA TGCCATCGAA
GCTGCGCTCG CCCCGGAGGA GGGGCCCTGG CTGTTCTTCG TGACCACCGA CCCGGAGAAC
GGGGTCACGA AATTCACGGA CAGCGAAGCC GAGTTCTGGG AACTGGTCAA CGAGTTCAAC
CAGAGCCAGA GCGGCTGA
 
Protein sequence
MNGNDHYDDR PRRARRPDPL TDPWPPPRDT DAAPTGARAA DYEGRGYRDE YGVGHRRGTE 
VPRQDGFAWQ PSSATQTTAP TGGAEPLDGT ERPRGRRRRP GPADTGSFPF PVSEAEALRE
QPRGRRRRPE SDESARWAPE SDPLFDGRGA QPAGPRSAER YDQPPDPLTA SAYAGRRARS
AHPAEYPQGR GTTPLRDEQR PGTAWSDDPL WSPADPGSRR ENDDILDGRT RRRQPSADED
RDIPRRGRRA RPPAFDEADS APPRGRRSRT VSPQTGETPT ATASLKTRET DLDDEDSDAL
PFPFTEDEDD EAPPRRSRRG RHSGRRAAGR RAKSRRQRKK SKVALASALI VLSLFLVSAG
TGGYLLLRTY IIPPDYSGEG NGEVDIVIEE GDSGTVIAEK LHQAGVIASV RAFTNEIRFS
DINFVPGTYR MRLGMSAEAA VALLLDPESR IALNVTIPEG LRAEQILDRL AEQTGIPREE
FQEAYEDHES LDLPEYATQG PEGYLFPETY EFDRSASATE ILQQMVAQYR KVAAEIDLEN
RAAEAGFDPN EIMAIAAIVQ AESGKIEDMG KVARVIYNRL DDGMYLKMDS TCFYALGEYG
IAINRDQQDR CRNDETGYDT YFHEGLPVGP IVSPGKDAIE AALAPEEGPW LFFVTTDPEN
GVTKFTDSEA EFWELVNEFN QSQSG