Gene Pars_1423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1423 
Symbol 
ID5055966 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp1283732 
End bp1284946 
Gene Length1215 bp 
Protein Length404 aa 
Translation table11 
GC content59% 
IMG OID640468964 
Productcitrate transporter 
Protein accessionYP_001153633 
Protein GI145591631 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1055] Na+/H+ antiporter NhaD and related arsenite permeases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.418451 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.0399282 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGGCCA GGCCGCTGTA CCCCAAGCTA CCCACGTGGT CTCTTATGTC CCTCGCCGCT 
TTTATAGCGG TGTTTTTTGG GCCCCTGGGC GTAGACGACG TGCCGCGGGT TGTAAACTTC
GAGGTGCTGC TCTTCCTAAT AGGCATGTTC TCCATAGTCG CCTTGGCCGA GTCAAGCGGG
TTGTTGGACG CCTTCGCCTA CTGGTTCGTC TCGCTACTCA GATCAAGGCT GTCCATATTC
GTCGGGAGCT CGTTGCTCTT CGGCCTCCTC TCGGCCATAG CCGTAAACGA CACAGTGGCC
CTCATGGGCC CCCCACTCGC CGCCGCAGTG GCTAGAGCCG CAGGCATTGA GTACAGGCAC
ATGTTCCTCC TCCTGGCCTT CTCGCTGACC ATAGGCTCCG TGATGACGCC CATAGGCAAC
CCCCAGAACA TGCTAATCGC AGTGGAGTCC GGCATGGCTA CGCCGTTTAT CACCTTCCTC
CGCCACCTCG CCATCCCCAC GCTGATAAAC CTCGTGGCCA CCCCCCTCCT CCTGTTTAAG
CTATTCGGCA TAAAGAACGA GAAAGTGCGC TACGTCGCTG TGGCGCGGGA GCACATAAGG
AACAGACGCG ACGCCGCGGT GGCCGCGGTG GTGTTGTTAG GCACCGTAGC CGCAATACTG
GCCAACGACC TCGCCGCGCT CTCGGGGCAC CCCCACATCA AGAACATCGG CTTCATCCCC
TTCGTCGCCG CGTCTCTGCT CTACTTCTTC GCCACCTCGC CCAGGGAGGT TCTGGCCAAG
GTCGAGTGGG GCACAATCAT CTTCTTTATC GCGATGTTTA TTACAATGGC TACTATCTGG
CATGGCGGCG TGTTGCAGCC CCTCACCTCC GCCTTGCTCC CCAGCTACTC CGGCTCTGCC
CTGGATCTTT TGGCCATCAC GGCCTTGTCC ATTGCGCTGA GCCAAGTTCT GAGCAACGTG
CCTTTTGTGA GCTTGTTCTC CACGTATCTC CACGAGCTGG GAGTCGCAGA CCCTAAGGCT
TGGGTCGCCC TCGCCATGGC CTCCACAATT GCCGGCAATC TCACCCTCCT AGGCGCCGCC
TCAAATATCA TAATCCTGGA GGTGCTCGAA ACCAGGTTCG GCGCCACCAT AACTTTCCTC
CAATTCTTGA AATACGGCGC CTTAGTAACC GCCCTAAACC TCGCCGTCTA CCTACCGTTC
CTCCTACTTG CATGA
 
Protein sequence
MLARPLYPKL PTWSLMSLAA FIAVFFGPLG VDDVPRVVNF EVLLFLIGMF SIVALAESSG 
LLDAFAYWFV SLLRSRLSIF VGSSLLFGLL SAIAVNDTVA LMGPPLAAAV ARAAGIEYRH
MFLLLAFSLT IGSVMTPIGN PQNMLIAVES GMATPFITFL RHLAIPTLIN LVATPLLLFK
LFGIKNEKVR YVAVAREHIR NRRDAAVAAV VLLGTVAAIL ANDLAALSGH PHIKNIGFIP
FVAASLLYFF ATSPREVLAK VEWGTIIFFI AMFITMATIW HGGVLQPLTS ALLPSYSGSA
LDLLAITALS IALSQVLSNV PFVSLFSTYL HELGVADPKA WVALAMASTI AGNLTLLGAA
SNIIILEVLE TRFGATITFL QFLKYGALVT ALNLAVYLPF LLLA