Gene Pars_1503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1503 
Symbol 
ID5055865 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1361727 
End bp1363175 
Gene Length1449 bp 
Protein Length482 aa 
Translation table11 
GC content60% 
IMG OID640469045 
Productextracellular ligand-binding receptor 
Protein accessionYP_001153711 
Protein GI145591709 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.128913 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones41 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTTATAA ATCAGTATTT GCGCCACTCC ATGAGGTGGA TAATCCCATT GGCAGTGCTC 
CTAGTGGTAA TACTTGGAGT TGTGGCGTTT TTGCTAATGT CTCCCCCGCC CTCTGCCGGG
GGCACCACGT CGCCCACGCC TACCTCCACA TCTCAACCCC CGGCAACGAC GCCGAAACCT
ACCACGACGA CACAGTCGCC GACTACTACC ACAACAACGG CGACTACCAC CCAGACGGCG
CAGCCCACTT ACAAGTGGCG GCTCGTCGGC ATGTACATCG CCTCAGAAGA CAGGGTGGTT
ACTGGTGACA TACCCGCCCA GAAGCCGCCC AGCGGCTACC TCACGCCTGT TATCGTCAAC
GCGACCCCCA CAGAGGCCAC TACGGTGGTT GAAATTGGAG TGCTGCAACC GCTCAGCGGT
CGCCTCGCCT CTCTGGGAGA GCTCTCAGCC GCGGCGGCCC AACTGGCGGA GCAGGACGTG
AATAGGTACT TGTCCAGCAT AAACGCCCCA TTTAGGGTGA GGGTTGTGGT CGCCGACACG
GCGGCTGACC CCACAAAGGC GCTGGACCAG ATGAAGGCTC TGCACAGCAG AGGCGTGAAG
TTCTACATCG TGAGGACGTC GGGCGAGGTG AGGGCGATGA AGTCCTACGC CGACGAGAAC
AAGCTGTTGA CCATTTCCGT GTCCTCCACG GCCCCGGCCC TCGCAATACC GGGCGACTAC
GTCTTTAGGC TACCGCCAGA CGACAACAAG CAAGTGAGGG CTATTTCAAA AATTATCCAA
GACAGCGGGG TGAAGGCTGT GGTGGCTATT TGGCGCAACG ACGACTGGGG CAACGGCCTA
GTTAGGGGGC TTGAGAACAT GAGCCGCGCG AGGGGGTTTG AGGTGATTAG GGCTGCCTCG
TACGACCCGC AGAAGGGCGA GTTCTCAACT GAGGTGGGGG TCTTGGCTAG GCTGGTTAAA
GACGCCATTG GCAAGTACGG CGCTGATAAG GTGGCGGTGG TTGCCTTCGG CTTCGCGGAG
CTCCAGACGA TTTTCCTCAC CGCGAAGAAC TATCCTGAGC TTAGATCTGT GAAGTGGTTC
GGCGCCGACG GCTCTACGGG GCTGTCTGAG CTTCTCGTCC CAGACGCGGC GGAGTTCGCG
GTGTCTGTGG GCGGCTTCGT GAGCCCCAAG TTCGCGCCTG CGAGGAGCCC CTACTACGAG
AGGGTGAGGA GCTACATTTT GGAGAGGTAT AAGAGGGAGC CCGACTCCTA CGCCTACAAC
GCCTACGACG CGGTTTGGCT CATTACGTAC TCCATACTCA AGGCCGGCTC TGCCGACAGC
GAGGCTGTCT GGCGGGTGTT CCCCCAGGTG GCGGCGAATT ACTTCGGCGC TTCTGGCTAC
ACTAAGCTCA ACGAGGCTGG CGACAGGGAC AGCGCCGATT ATGAGATCTG GGCCATTGTT
AAGGGCTGA
 
Protein sequence
MFINQYLRHS MRWIIPLAVL LVVILGVVAF LLMSPPPSAG GTTSPTPTST SQPPATTPKP 
TTTTQSPTTT TTTATTTQTA QPTYKWRLVG MYIASEDRVV TGDIPAQKPP SGYLTPVIVN
ATPTEATTVV EIGVLQPLSG RLASLGELSA AAAQLAEQDV NRYLSSINAP FRVRVVVADT
AADPTKALDQ MKALHSRGVK FYIVRTSGEV RAMKSYADEN KLLTISVSST APALAIPGDY
VFRLPPDDNK QVRAISKIIQ DSGVKAVVAI WRNDDWGNGL VRGLENMSRA RGFEVIRAAS
YDPQKGEFST EVGVLARLVK DAIGKYGADK VAVVAFGFAE LQTIFLTAKN YPELRSVKWF
GADGSTGLSE LLVPDAAEFA VSVGGFVSPK FAPARSPYYE RVRSYILERY KREPDSYAYN
AYDAVWLITY SILKAGSADS EAVWRVFPQV AANYFGASGY TKLNEAGDRD SADYEIWAIV
KG