Gene Pars_1896 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1896 
Symbol 
ID5055511 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1702159 
End bp1703229 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content49% 
IMG OID640469445 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001154099 
Protein GI145592097 
COG category[E] Amino acid transport and metabolism
[P] Inorganic ion transport and metabolism 
COG ID[COG0601] ABC-type dipeptide/oligopeptide/nickel transport systems, permease components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.881035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones43 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTTCTC TAGCCAAGAC TCTCCTTGTT AAGGCCATAA CCCTGGTCGT GGTTTTGATA 
GGCGTGTTAG TGCTTCTAGC AGTAATAATG GGAGCCACCG GCCTCTCTGA TAAAATGCTA
AACTCGATCC TCACGTCGGA GGTGCAGGAG TATAAACAAC AACTTTTAAG ACAGGGCAAA
GACATAGCCG CAATAGAAAA GGCTGTTGAG GAGTTTAAAA AAGAAAGAGC GGCGGCGCTG
GGAATAGACA GGCCTTGGTA CGAGAGGATG CCCCAGTTGA TCTACCGCCT CCTCGTCTTG
GATCTCGGCA ATTCAAGAAC GTTGCAGTCG TCGTGGGGGT CTAACAGAAT AGCCGATCTT
ATACTAGATC GACTGCCCAA TACCATAATT TTGACAACTA CCGGCATTTT ACTCACAGCG
CTTGTAGGCA TATGGATGGG GCTGTACATG GGGGCTAATA TTGGGAGTAG AGCCGATAGG
GTCGTGTCAA TTTTGTCGGC GGCGTCTTAT GCCTTGCCGC TGTGGTTTGT TGGTCTTGTC
CTGATACTAG TGCTTGCCTA CGGCCCCAAG ATACTGTGGG GCGTCCAGAT ATTTCCGCCG
GGAGGCATGG TATCTACGCC TCCGCCGAAG GAGCCTCTGG CCTATGTATT AGACGTAATG
TGGCACCTTT CCCTACCTCT GCTTGCCTCC TTTATTGTCT TTTTTGGGAG CTGGGCCTAC
ACTACTAGGA ACATTGTCTT CAGCGTATCA CAGGAAGACT TTGTTAACTT TGCCAGGGCA
AAGGGACTGC CGGAAGATAT GGTAAGAAAC AGGTACATAC TTCGTCCCTC ACTTCCACCC
ATCTTGACTA ACCTAATTCT GAGCCTCGCG GCCTCGATCT CGGGGTATAT CATTACGGAG
AGGGTTTTCA ACTGGCCGGG CATGGGGTCG CTGTTCTACG CCGCAATAAC GGCACTCGAC
GAGCCGGTAA TCTTCGCATT GACATACGTC TTTACGCTTG TGTATATAAT AGGGAGGTTT
ATACTAGAAA TACTCTACGT CTTACTCGAC CCCCGAATTA GGTTATCATG A
 
Protein sequence
MASLAKTLLV KAITLVVVLI GVLVLLAVIM GATGLSDKML NSILTSEVQE YKQQLLRQGK 
DIAAIEKAVE EFKKERAAAL GIDRPWYERM PQLIYRLLVL DLGNSRTLQS SWGSNRIADL
ILDRLPNTII LTTTGILLTA LVGIWMGLYM GANIGSRADR VVSILSAASY ALPLWFVGLV
LILVLAYGPK ILWGVQIFPP GGMVSTPPPK EPLAYVLDVM WHLSLPLLAS FIVFFGSWAY
TTRNIVFSVS QEDFVNFARA KGLPEDMVRN RYILRPSLPP ILTNLILSLA ASISGYIITE
RVFNWPGMGS LFYAAITALD EPVIFALTYV FTLVYIIGRF ILEILYVLLD PRIRLS