Gene Tneu_0420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_0420 
Symbol 
ID6166066 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp377649 
End bp379538 
Gene Length1890 bp 
Protein Length629 aa 
Translation table11 
GC content62% 
IMG OID641667578 
ProductAMP-dependent synthetase and ligase 
Protein accessionYP_001793814 
Protein GI171184895 
COG category[I] Lipid transport and metabolism 
COG ID[COG0365] Acyl-coenzyme A synthetases/AMP-(fatty) acid ligases 
TIGRFAM ID[TIGR02188] acetate--CoA ligase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value0.0761365 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCGCGG AGTTCGTAGA GGTATACAGG AAGTCGCTGG AGGACCCCAT CGGCTTCTGG 
GAGAAGCAGG CGGAGAGGCT GTACTGGAGG GAGAGGTGGG AGAAGACCTA CGACGACTCC
AACCCCCCCT TCTACAGGTG GTTCGTAGGC GGCAAGACCA ACATCTCCTA CAACGCCCTA
GATAGGCACG TTAAAGGCGG GAGGGCCAAC AAGGCGGCGT TGATCTGGGT CTCGGCAGAC
GGCGCCACCA GGGTGCTCAG GTACTGGGAC CTCTACAGGG AGGTCAACCG CTTCGCCGTG
CTTCTGAAGA GCCTCGGCGT AGAGCGGGGC GACAGAGTGG CGATATACAT GCCGATGATA
CCCGAGGCCA TGGTGGCGAT GTTGGCCGTG AACAGAATAG GGGCTGTGCA CACGGTGGTC
TTCTCCGGCT TCGGCCCCCA GGCGCTCGCC GAGAGGATAA AAGACGCCGA GGCCAAGGTG
GTGATAACCG CAGACGGCAT GAGGAGGCGC GGCAGGGTGA TCCCCCTGAA GCCCACGGTA
GACGAGGCGC TGAAGATAGT GGGCAACGAC ATATTCACGG TGGTGTACAA ACACACGGGG
GTCGAGGTCC CCATGAAGCA GGGCAGAGAC CTCTGGTGGC AGGAGGAGAT AGCCAAGATC
CCCCCAAACA CCTACATAGA GCCCGAGTGG GTGCCCGGGG AGGCGCCGCT CTTCATACTG
TACACCTCCG GCACAACCGG CAAGCCGAAG GGCATACTCC ACCTACACGG CCAGTACATG
GTGTGGATCT GGTACGCCTT CAACCACCTC ACCGGAGCCG AGAGGGACTT CAGAGAGGAC
ATAGTCTTCT TCTCCACAGC AGACATAGGC TGGATCTCCG GCCACCACTA CGGCGTCCAC
GGCCCCCTCC TCAACGGCCT GACCGTCCTC TGGTATGAAG ACGCCCCCGA CTACCCCCAC
CCCGGCATCT GGTGGGAGAT CGCCGACACC TACAAGGTCA CCCACATGTT GTTCTCCCCC
ACCGCCATCA GGCTGTTGAT GAAATACGGC GACGAGTGGC CCAGGAGGTA CAAGCTAGAC
AGCATAATGG CCCTCTACCC CACCGGCGAG GTCCTCAACG AGGAGGCCTA CAACTGGATG
AGGCGGGAGG TATGTAGGGG GAGGCCCGAC TGTCAGATAG CCGACATATG GGGCCAGACC
GAGACCGCCT GCTTCGTCAC AGCCCCCGGC TCCATGAACC TAGGCGGCTT CCGCTACAAA
TACGGCTCGG TGGGCATGCC CTACCCCACC CTCAACCTGC AGATCCTAGA CGACGATGGG
AAGCCGCTTC CGCCCGGCGC CAAGGGACAC GTGGTGGCCA AGCCTCCGCT GCCCCCCGCC
TTCCTACACA CCCTGTGGCG CGACCCGGAG AGATACGTCA AGTCCTACTG GTCCCGCTTC
CCAGGCTACT ACTACACCGG CGACCTCGGC TACATAGACC AAGACGGCCA CCTCCACATA
ATGGGCCGCT CCGACGACGT GATAAAGGTG GCCGGCCACA GGCTCTCCAC CAGGGAGGTG
GAGGACATAC TCACCAGCCA CCCCGCCGTA GCCGAAGCCG CCGTGGTGGG CGTGCCAGAC
GAGGTCAGAG GCGAGGTGCT GGGGGTCTTC GTGGTGCCCA AACAAGGCAT GAAAATCACG
GAGGAGGAGG TGGTTAAACA CCTCAGGAAC TCCCTCGGCC CCGTGGCGGT CATTGGAAAA
GTCGCGATAC TGGATAAGCT CCCCAAGACC AGGACAGGCA AAGTCATGAG GAGGGTGCTG
AGGGCCATGG CCACCGGGCA ACCCGTAGGC GACCTAAGCA CCCTAGAAGA CGAGGAGGCC
CTGGAGGAGC TAAGGAAAAA ACTCGGCTAA
 
Protein sequence
MSAEFVEVYR KSLEDPIGFW EKQAERLYWR ERWEKTYDDS NPPFYRWFVG GKTNISYNAL 
DRHVKGGRAN KAALIWVSAD GATRVLRYWD LYREVNRFAV LLKSLGVERG DRVAIYMPMI
PEAMVAMLAV NRIGAVHTVV FSGFGPQALA ERIKDAEAKV VITADGMRRR GRVIPLKPTV
DEALKIVGND IFTVVYKHTG VEVPMKQGRD LWWQEEIAKI PPNTYIEPEW VPGEAPLFIL
YTSGTTGKPK GILHLHGQYM VWIWYAFNHL TGAERDFRED IVFFSTADIG WISGHHYGVH
GPLLNGLTVL WYEDAPDYPH PGIWWEIADT YKVTHMLFSP TAIRLLMKYG DEWPRRYKLD
SIMALYPTGE VLNEEAYNWM RREVCRGRPD CQIADIWGQT ETACFVTAPG SMNLGGFRYK
YGSVGMPYPT LNLQILDDDG KPLPPGAKGH VVAKPPLPPA FLHTLWRDPE RYVKSYWSRF
PGYYYTGDLG YIDQDGHLHI MGRSDDVIKV AGHRLSTREV EDILTSHPAV AEAAVVGVPD
EVRGEVLGVF VVPKQGMKIT EEEVVKHLRN SLGPVAVIGK VAILDKLPKT RTGKVMRRVL
RAMATGQPVG DLSTLEDEEA LEELRKKLG