Gene Tneu_1401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1401 
Symbol 
ID6164974 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1244667 
End bp1245716 
Gene Length1050 bp 
Protein Length349 aa 
Translation table11 
GC content57% 
IMG OID641668556 
ProductDNA primase large subunit 
Protein accessionYP_001794772 
Protein GI171185853 
COG category[L] Replication, recombination and repair 
COG ID[COG2219] Eukaryotic-type DNA primase, large subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones47 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCATGCA ACGTTTCTCT TGGAGAACTG GCGTGCTTCT TCCCGTTTCT GGACAAATCG 
GCGGCGTATC TATCGAGGAG GGGCCTCCTC CTAGATGCCG TTTTAGCCGA CGGAGCTCTT
GTGGAGAAGG CGCTTGAGCG ACTAAGGACC TCGCTGGAGC GCCGCTTGCC TAACCCACGT
CAGTGTCTAG ACGCTCCAGA CGCAACGGCG GCCGCGGCCA GGCTCGCCTT GTATATAGCC
GCCGCCACGA AAAACCCACA CGTCTTGCGG AGGTTTGCAG ATTCCGAGAG CAAGAACTTC
ACCGCGCTCC TAAGGAAGGT TCCCGGCATA CAGGACCCGA GGTGTAAGAT AGAGCTGGCA
AGAGATCTGG GCGTAAACGC GAGGCCAGCA GGCGAGGTAG CAGCGGGCAT CATCATGGCG
GTGTACAACT CGCCTGTAGC GGTGAGGTGG ACGTCGTATC TTAGATATGC GCCGCAAGAC
CCTTACTGGG CCATGATTAA TAGGCCCGTC GTGAGGGGAT GGGTGATCGT GCCCATGGAG
GACTTCGAAA GGCTTCTCGA GGAGTCGTAT GAAGAGCGTA TACTTCAGAT AGTTAGAGAA
AACGAGCTAG CCGTCGGAAA AGTCGCCATG TCCATAGACC TAACCCAGCT AGAGGATGTA
ATAAAGAGGT ACTCCCAACG CCCAGTGCTA CAAGCGGGGC AAGCTACGGG AGGCGCCGAC
CCCCCCTGTA TGGAGGCGAT ACTCGCGGCG CTCAAGAAGG GGGAGAACCT GCCACACACC
GCCAGATTCG CCATCACGAC CTACCTAATC AAGAGGGGAT GGGACGTGGA GCAGATAGTC
GAGTTGTTCC GCGCCTCACC TGACTTCAAC GAGAAGATAA CGAGGTACCA AGTTATGCAC
ATAGCCGGCC AAGCAGGAGG TAGAAAGGAG TACGCCGTGC CCAGCTGCGA AACCATGAAC
TCGTGGGGCT TGTGCCCCAC CAACCTCAGA TGCGGCGTTA AAAACCCGCT TCAGTATGGG
AGAAGACTTG CAGTTAAAAA GAGTAGTTGA
 
Protein sequence
MACNVSLGEL ACFFPFLDKS AAYLSRRGLL LDAVLADGAL VEKALERLRT SLERRLPNPR 
QCLDAPDATA AAARLALYIA AATKNPHVLR RFADSESKNF TALLRKVPGI QDPRCKIELA
RDLGVNARPA GEVAAGIIMA VYNSPVAVRW TSYLRYAPQD PYWAMINRPV VRGWVIVPME
DFERLLEESY EERILQIVRE NELAVGKVAM SIDLTQLEDV IKRYSQRPVL QAGQATGGAD
PPCMEAILAA LKKGENLPHT ARFAITTYLI KRGWDVEQIV ELFRASPDFN EKITRYQVMH
IAGQAGGRKE YAVPSCETMN SWGLCPTNLR CGVKNPLQYG RRLAVKKSS