Gene Pars_0200 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_0200 
Symbol 
ID5055943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp179822 
End bp181039 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content59% 
IMG OID640467779 
Productmajor facilitator transporter 
Protein accessionYP_001152467 
Protein GI145590465 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones29 
Fosmid unclonability p-value0.661715 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGACTCG CCGCTAGCTA TGTTCCCATT TTAGTTGCCC GTATTGGGTC GGGGGCCTCT 
GCGTTTCTGG TAGTCAAGCT GGCAAGCGGG GGCGCCGTGG AGGCGGGCTT CGTCCTCGCC
GCCTATCCGT TTCTAGAGGC GCTGGGAGCC TTCGTGGCGG GTCGTTGGTC GGATACCCTG
GGCAGAAAAA CAACGCTGAT TATGGGCTAC GTGGTGAGGT CGATTGCGAT GTTGGCTCTG
GCGTGGGCTT TCTACACGCA TGAAGCCCCG TGGCTGGAGG CGTTTCTAAA CGGGGTAATA
GGCTTCACCA CCGCGTTTAT CCTCACGTCG TCGCTCGCAA TGGCCACAGA CCTCACAGAG
GTGAGAAATA GGGGGCTGGG TATGGGAGGT TTTGAGTTCA TAAACCTGGG GAGCTACGGC
GTGGGCTACC TCTTGGGCTC TGCCTTGTAC TCCATTTTTC AGGACCCATC AGCGTATTTA
GCGGTGGCGT TGTTCACCAC AGTTGCTATT CCAGTATTCG CAAAGTACAT AGAGGAGACG
AGACCAGCGG CGCCTGGGGA GGGGAGGCTC TTGCTCTCGG TACTGCCGCC TTCGGCGGTG
GCTCTTCTCC CCGTGTGGTT TGCCTTAACA ACGATAATAG GGCTTGCGAT GTATTCGCCA
AGAATTTTAA GAATAGAAGG GGGCAACCTC GGCGTGGCGG GGCACATCGT GCAGATGCTC
GGCGGTCACC TGGCAATCGG CCTCTTGTTT ATCAGTGCCT TGGCTTTGCT GGGGCTGGGC
GCAATATTCT TCGGTAGGCT GGCCGACAGG TGGGGGCGGC TGAAGACCTT TAGGCTGGGG
CTGATAGGCG GCCTCCTCGC CCTTGTAACG CTAAACGTTG CGCTACGCCT CAGCCTAGGC
GTCGTTGAGG CAGTCGCAAT CACGGCCCCC CTGCTGTTCC TAACCTCGGC TATTGGACCC
TCGATCTTGG CCATGATCGG CGACGAGGCC GATATAAGGT ATAGGGGGAC TGTCATGGGG
ATATACAGCG TTATGCTAGG GCTTGGGATC GGCTTCGGAA GCCTTCTAGG GGGCTTCGTG
GCCGCCGCGT TTCCGCAATA TGAAATAAAC GGGCTAGCCG CCGCGGCGCT CGGCGTATAC
GCCACAATGG CGGCGCTCCA CTTGGTCGTA GCTAACACAT CCGCCGGGAA GAGGGGGCTA
GCGCTAGAGA AGGGGTAG
 
Protein sequence
MRLAASYVPI LVARIGSGAS AFLVVKLASG GAVEAGFVLA AYPFLEALGA FVAGRWSDTL 
GRKTTLIMGY VVRSIAMLAL AWAFYTHEAP WLEAFLNGVI GFTTAFILTS SLAMATDLTE
VRNRGLGMGG FEFINLGSYG VGYLLGSALY SIFQDPSAYL AVALFTTVAI PVFAKYIEET
RPAAPGEGRL LLSVLPPSAV ALLPVWFALT TIIGLAMYSP RILRIEGGNL GVAGHIVQML
GGHLAIGLLF ISALALLGLG AIFFGRLADR WGRLKTFRLG LIGGLLALVT LNVALRLSLG
VVEAVAITAP LLFLTSAIGP SILAMIGDEA DIRYRGTVMG IYSVMLGLGI GFGSLLGGFV
AAAFPQYEIN GLAAAALGVY ATMAALHLVV ANTSAGKRGL ALEKG