Gene Pars_1101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPars_1101 
Symbol 
ID5055487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePyrobaculum arsenaticum DSM 13514 
KingdomArchaea 
Replicon accessionNC_009376 
Strand
Start bp986269 
End bp988323 
Gene Length2055 bp 
Protein Length684 aa 
Translation table11 
GC content60% 
IMG OID640468657 
Productglutamate synthase (NADPH) 
Protein accessionYP_001153331 
Protein GI145591329 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0069] Glutamate synthase domain 2 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value0.338613 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGTCTGT GGAATATATA CCTGAAACCG GCGGTACATA GGGTGCCGGA ATTCTGGAGC 
CACGAGAAGA TAGAGTATAT ACGCCGGTCC TCCCAGGAGG GCATTCCGCC CTACGCCTTG
GAGGAGGCAC CTAGCAAAAT CGGCCGCCTC CTCGACAGGT TGGTCTTCAA GGACCTGCGC
CCCCGCGAGG TCAACGAGCT TTTGAAAAAG GCCGACGGCC TCGACGTGGA TGTAGGCCTC
GACTTCTTCG GCACGAGGCT CACAGCGCCT ATATACCTGG GGGATATGTC ATTTGGAGCG
CTAAGCGGCA ACCCCAACAT AGCTATAGCC AAGGTCTCCA CAGAGGAGGG CATGGTGGCC
GGCATAGGCG AGGGCGGGCT CCACCCCGAA GTGGCGAAGT ACAGAAATAT AGTGGTGCAG
TGGGCCTCGG CGCGTTTCGG CATGGACATG GCGTTGCTTA GGGCGGGCTT GGCGGTGAAT
ATAAAGATAG GCCAAGGGGC GAAGCCAGGT ATAGGAGGCC ACCTCCCCGG GATAAAGGTG
ACGAAAATTA TCGCGGAGCT GAGGAAAATA CCCGAAGGTA GCGAGGCACT TTCGCCGGCA
CCTCACCACG ACATATATTC AATAGAGGAT CTGGCGCAGA GGGTGAAGGC CTTGAGAGAC
CTCACAGGCA AGCCTGTCCT CGTTAAGGTG GCTGCTGTAA ACAAGATAAT GTACGTGGCG
GTTGGGGTAT CGCGGTCCAC AGCTGAGGGC ATAATAATAG ACGGCGCCGG CGCCGGGACG
GGCGCCACGC CCATATCTGT GCGCAACCAC CTAGGCATAC CCATAGACTA CGCAATACCG
GTGGTGGACA AATGGCTCAG GGAAAACGGG GCAAGGAGCA ACTTCCTGGT AATCGGCGGG
GGGATGTTGT ACAGCGCCTC GGACATAGCC AAGCTGATAG CCCTCGGCGC CGATATGGCC
AACATCGGGA CAGCCGCTCT CCTCTCCTTC GGATGCATAA TGTGCCACTC CTGCCACACA
GGCGGGTGCC CCACCTCCCT TACCAACATG ATCGGCGCCA GGCCAGACCT AGATATCGAG
TGGGCCTCCG CCATGTTGAG GAGGTACCTC AGGGCGCTGA GGCTCGGCCT TAAGGCCATC
CTCTACTCCC TAGGCATGGA CAGCCTCAAG GAGCTGGCCG GCCGGAGGGA CCTCCTGGGG
CTCTACTACG CCGACGAGGC CGTGGCCTCC ACAATAGGCG TTGACCTAAT GGCGGAGGGG
GAGATAGCGT ATTTCCAAGA ATACGCGCCG ACGATGCCGA GGGAGGTGTA CGAGGAGGGC
AAGGTCCCGA TAATAGGCAT GGGAGGCGTG GTGCCAGGCT ACACCTACCC CGCCAGGAGG
CCCCTCGACT TGTTGAGGAT AGAAGCGGCG CAGGTAACCC ACCCATCTGT TGACCCGTAC
AGGGAGGAGG TGGACGTGAG GGTTTACGTC AACGGCTTCT CCTACGACAC GCCCATAGTG
GTGCCCGGAC TAGAGCGCGC GGCCGTGATG GCGGGCTACG CCCTCGGCGC GCTGGTAGAC
GGCGCCGGTT GCCCAGAGCC GGAGTACTGC CTATCCACAC TCCCCCACGT CAGGTTGCCG
CCTTCTAGAG ACATAGAGCC GAGGGAGGGC ATCGTAATCA TCGACGAAAG ACTAAAAGGC
GACGCATGGC TGGAGGAAGC CGTGGCGTTG CTGGACGAAA AGGCCAGACA AGCCGGGATT
AGGGAGAAGA TGACCATAAT AGCGGCGGGG CGCCTCTCAA GCGGCGCCGA CGTCTACAAG
CTGGCGGCCC TAGGCGCAGA CCTGGTGGAG CCCCGCGAGG CGTTTGAGCT CCTGGCCAGC
AGGCAGGCCC CCAGCTACGC CGCCAAGAGG CGTTGGTACG AAAACCTCAT ATCAGTCCTC
ACAAAGGAGC TCAAGCTGGC CATGGGCGCG GGGGGAATAA CAGACTACTA CCACATGGTC
GGCAACAAGG ACCTGCTAAG ATCCCTAGAC GGGAGAATAG CCGCCAGGCT CAACGTACCG
GTGGCGGGCA ACTAA
 
Protein sequence
MRLWNIYLKP AVHRVPEFWS HEKIEYIRRS SQEGIPPYAL EEAPSKIGRL LDRLVFKDLR 
PREVNELLKK ADGLDVDVGL DFFGTRLTAP IYLGDMSFGA LSGNPNIAIA KVSTEEGMVA
GIGEGGLHPE VAKYRNIVVQ WASARFGMDM ALLRAGLAVN IKIGQGAKPG IGGHLPGIKV
TKIIAELRKI PEGSEALSPA PHHDIYSIED LAQRVKALRD LTGKPVLVKV AAVNKIMYVA
VGVSRSTAEG IIIDGAGAGT GATPISVRNH LGIPIDYAIP VVDKWLRENG ARSNFLVIGG
GMLYSASDIA KLIALGADMA NIGTAALLSF GCIMCHSCHT GGCPTSLTNM IGARPDLDIE
WASAMLRRYL RALRLGLKAI LYSLGMDSLK ELAGRRDLLG LYYADEAVAS TIGVDLMAEG
EIAYFQEYAP TMPREVYEEG KVPIIGMGGV VPGYTYPARR PLDLLRIEAA QVTHPSVDPY
REEVDVRVYV NGFSYDTPIV VPGLERAAVM AGYALGALVD GAGCPEPEYC LSTLPHVRLP
PSRDIEPREG IVIIDERLKG DAWLEEAVAL LDEKARQAGI REKMTIIAAG RLSSGADVYK
LAALGADLVE PREAFELLAS RQAPSYAAKR RWYENLISVL TKELKLAMGA GGITDYYHMV
GNKDLLRSLD GRIAARLNVP VAGN