Gene Tneu_1397 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1397 
Symbol 
ID6165056 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1240934 
End bp1242208 
Gene Length1275 bp 
Protein Length424 aa 
Translation table11 
GC content60% 
IMG OID641668552 
ProductUbiD family decarboxylase 
Protein accessionYP_001794768 
Protein GI171185849 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0043] 3-polyprenyl-4-hydroxybenzoate decarboxylase and related decarboxylases 
TIGRFAM ID[TIGR00148] UbiD family decarboxylases 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCGCTGG CCGACGTCGT AAGAAAGCTC TCAAACGTCC AAGTCTACGG CCCTCCCTAC 
GGCGAATTCA CGGTAGCCCG GATCCTTAAG GAGTCGGAGG GCCGGTGGAC CCCCGTCTTT
GAGAGGGTGG GAGAGGGCTT CACCGGCGTT GGAAATATCG TAGATACAAG GGAAAAGCTG
TACTCCTACC TAGGCGTAAA AAGCGACGAG GAGGCTTATA CGAAGTTGTT AAACGCGGAG
GAGAACCCGG CAAATCTAGA GACCGCATCT ACGTGGAGGG ACATGTATAG GGAGGTCGAG
TCTCTGTACG ATCTGCCTAT GGTGCGGTAT TACGAGAGAG AGGCGAGGCC GTACATCACC
TCAGGCGTTG TGGTAGGGGT GGGCCAAAGC GGTGTTATGA ACGCCTCTAT TCACCGGTTT
TCTCCCATTG GGGCGAGGCG GGCGGTGGTG AGGTTGGTCC CGAGGCACCT CTACCAGATC
TACAGGTCCG CTGTGGAGAA AGGCGTGGAG GTCCCCGTGG CGGTGGCCTG GGGGGTACAT
CCACTGGTTC TGCTGGCCGC CGCCTCCTCG CCCTCCTTCG GCGTGTTTGA GCTCGGCGTT
GCAGCCCGCC TCATGGGGGG GCTGAAGGTC GTCGAGTTGG AAAACGGGGC TGTGGCGCCG
TTTATGGCGT CGGTGATAGT GGAGGGCTAT CTGACGCGGG ATATGGCCGA GGAGGGGCCC
TTCGTGGATA TAGTCGGCGT TTACGACAGG GTGAGGCTCC AGCCAGTGGT GCGGGTGGAG
AGGATCTACG TGCTGCGGGA GGGGGCCTAC GTCCACTACC TGTTGCCGGC TGGACTTGAA
CACATGTTGC TCATGGGGTT TGAAAAAGAG GCGAGGATAT GGAGAGCCGT GAGGTCCGTG
GCGCCTAGGG TGAGGAAGAT CAGGTTGACT AGAGGAGGCT TCGGCTGGCT TGTAGCCGTG
ATTTCTCTGG AGAAGTCCGT AGAGGGCGAT GCGAAAAACG CGCTTCTAGC GGCGTTTGCG
GCTCACCCCA GCCTAAAAAT CGCGATTGCG GTAGACAGCG ACGTGGATCC GGAGGACCCC
GTCGCCGTTG AGTGGGCGGT GGCGACGAGG CTCCGGGCCG ACAGGGGTTT GTTCGTGGTG
CCGTACGTCA GAGGGTCAAC TCTCGACCCG GTGGCGTTAA ACGAGGAGGG CTTGACGCAT
AAGGTGGGCA TAGACGCCAC AAGGCCCCTA GATGCAGATC CTTCGTTGTT TGAGAGGGCG
AGGATACCAG GTTAA
 
Protein sequence
MSLADVVRKL SNVQVYGPPY GEFTVARILK ESEGRWTPVF ERVGEGFTGV GNIVDTREKL 
YSYLGVKSDE EAYTKLLNAE ENPANLETAS TWRDMYREVE SLYDLPMVRY YEREARPYIT
SGVVVGVGQS GVMNASIHRF SPIGARRAVV RLVPRHLYQI YRSAVEKGVE VPVAVAWGVH
PLVLLAAASS PSFGVFELGV AARLMGGLKV VELENGAVAP FMASVIVEGY LTRDMAEEGP
FVDIVGVYDR VRLQPVVRVE RIYVLREGAY VHYLLPAGLE HMLLMGFEKE ARIWRAVRSV
APRVRKIRLT RGGFGWLVAV ISLEKSVEGD AKNALLAAFA AHPSLKIAIA VDSDVDPEDP
VAVEWAVATR LRADRGLFVV PYVRGSTLDP VALNEEGLTH KVGIDATRPL DADPSLFERA
RIPG