Gene BURPS668_2002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2002 
Symbol 
ID4882590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1980094 
End bp1982796 
Gene Length2703 bp 
Protein Length900 aa 
Translation table11 
GC content73% 
IMG OID640127930 
Productargininosuccinate lyase 
Protein accessionYP_001059037 
Protein GI126439921 
COG category[E] Amino acid transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase
[COG0439] Biotin carboxylase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.848902 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
TTGGCGGCCA CATTCATCGT GAAGACCTTC GTATTCATCG AAAGCAACAC CACCGGCACC 
GGCCGGCTCT GCCTGCAAAA AGCGCTGCTG CGCGGCTTCG ACGTGCTGTT CGTCACGAGC
CGGCCACAGC TCTATCCGTT CCTGCAGGAA GAGATGGTCG TGCCGCTCGT CGCCGACACG
GCCGATCCGC AGCGGATCGC CGATGCGCTT GCGCCGTATG CGGGCATCGC CGGGATCTTC
TCGACGTCCG AGTACTACAT CGAAACCGCC GCGACGGTGG CCACGCGCCT GGGCTTGCCC
GCGGCGGATC CGGAGGCGAT CCGCACCTGC CGCGACAAGG GCCGGCTGCA CCGCCGCCTG
CGCGACGCGG GCGTCGGCGT GGCCGACACC GAGATCGTGT CCGAGCGCAC GCAACTGCGC
GACCTGGCGC ACGGCGCCAC GTATCCGCGC GTGCTGAAGC CGGCGTTCGG CTCCGGCAGC
GTCGGCGTGC GGCTCGTGCG GACGCCGGCC GAGATGCTCG CGCACGGCGA GCGCATGCTC
GACGCGCGCG GCAACGAGCG CGGCATCGCG CTCGCGCGGC AGGTGCTCGT GCAATCGTTC
GTCGACGGGC CGGAATTTTC GGTCGAAGTC GTCGGGCTCG GCGCGGAGCA CGGCCATGCG
GTGCTCGGCG TGACGGGCAA GCACCTCGGG CCGCTGCCGC ACTTCGTCGA AGCCGGCCAC
GATTTTCCGG CGCCGATCGC GGCCGCGCAG CGCGATGCGA TCGTGGCCGA GACGCTGCGC
GCGCTCGACG CGGTGGGCCA CCGCTTCGGG CCCGCCCATG TCGAATGCCG CGTGAGCGAC
GGCAAGGTCG TCGTGATCGA GATCAATCCG CGTCTCGCGG GCGGCATGAT CCCGCAGGCG
ATCGAATGGG CGACGGGCGT CGACGTGCTC GGCGCGATGA TCGACCTGCA CGCGGGCACG
CCGCCTGACC TGGGCCCGCG CCGCCGCGGC CACGCGGCGA TCCGCTTCGT GCTGCCCGCG
CGCAGCGGCG AGCTGAGGGC GCTGTCGTTC GAGCCCGACG AGCGCTTTGC GGGGGTGCGC
ACGCGCTTCA TGCCGCTCAA GCAGCTTGGC CAGCGCATCG AGCCGGCCGG CGACTTCCGC
GATCGTCTCG CGCTCGTCAT CGCGTCCGCG GCCGATCCGG ACGCGCTCGC GCACGCGCTC
GAGGACGTCG ATCGCTGCGT GACGGTCGCG ATCGGCGACG CCGGCGCGGC GGGCGAGGGC
GCGGGCGCCG GCCGGCTGCG CCGCACGCTG CATCCGGAGG CGCTCGCGAT CGTGCGCAAG
CCGGCGCCGC GCGCCGAGCG GCTCGCCGAA CTCGACGCGT TCGCGGCGAT CGACGAGGCG
CACCTGCTGA TGCTCGTCGA CGCGGGAATC TGCGACCGGG CGCGAGCCGC GACGGTGCTC
GCGGAACTCG CGCGGCAGCG CGACGCGAAA TTCGCCGCGA TCGCCGACGC GATCGCGCCG
CGCGGCACCT ACGCACTGTA CGAGCAACTG CTCATCGAGC GGGTCGGGAT CGACGCGGGG
GGCGCGGTGC ATACGGCGCG CTCGCGCAAC GACATCAACG CGTGCGTCGC GAAGCTGCGC
GCACGCGAGT GGTTCGACAC GTGCGGCGGC AAGCTGTGGC GCGTGCGCGC GGCGATCGTC
GACAAGGCGC AGCACACGCT CGACTGGCCG TTGCCCACGT ACAGCCAGTA CCAGGCGGCG
CAGCCCGGCA GCTTCGGCTA TTACCTGTGG TCGGTCGAGA CCGCGCTGCG GCGCGACCAG
GCGGCGCTCG AACGGCTCGA CGAGGAGCTC GCCGTCTGTC CGCTTGGCGC GGGCGCGGGC
GCGGGCACCG ATTTCCCGAT CCGCCCGGGC GTGAGCGCGG CGCTGCTCGG CTTCGCGCGC
AGCTTCGACA GCGCGCTCGA CGCGGTCGCG AGCCGCGATC TCGTGCTGCA TTTCCTGGCC
GCGATCGCGA TCGCATCGAC GACGCTCAGC CGGCTCGCGC ACGACCTGCA GCTCTGGACG
ATGCGCGAGA CCGACTTCCT CGCGCTGCCG GACGAACTGA GCGGCGGCTC GTCGCTGATG
CCGCAGAAGA AGAACCCATA CCTGCTGGAG ATCGTCAAAG GCAAGCTCGC GCACGTCGCG
GGCGCGCTGA ACGCGGCGGT GTTCGCGTCG CAGCGCACGC CGTTCAGCAA TTCGGTCGAG
ATCGGCACCG AGATGCTCGC GCCGTGCGCG GACGCCGTGC AGGCGTTCGG CGAAAGCTGC
GATCTGTTGC GGCTGATGGT GAGCGGCGTG ACGGGCGATC CGGCGAAGAT GCGCGCGGCG
GCCGAGGCGG GGCTCGTGAG CGCGACGCAG GTCGCCAACG CGCTGGTGCG GGAGACGGAC
ATCAGCTTTC ACGCCGCGCA TCGGCAGATC GGCGCGCTGA TCACGCAGGC GCTCGACGCG
CACGAGGACC CGGCCGCGGC GCTCGACGCG CTCGTGCGGC AGCCGGGCGC ATCGATCGAC
GAAGCGGCCG CGCGGCTCGC CTACGGCGGC GGGCCCGGCG CGGCGGGCGC GGGGCTCGCG
CGCTCGCGCG CGCTGCTGCG GCAGTCGGCC GAACGCCTGT GGCGGCGCCG CGCCGCGTGG
CACGCGGCGC ACGCGCGGCG GCGCGGGTGC GTCGCCGATC TGCTCGCGGC GGCGGCGGCC
TGA
 
Protein sequence
MAATFIVKTF VFIESNTTGT GRLCLQKALL RGFDVLFVTS RPQLYPFLQE EMVVPLVADT 
ADPQRIADAL APYAGIAGIF STSEYYIETA ATVATRLGLP AADPEAIRTC RDKGRLHRRL
RDAGVGVADT EIVSERTQLR DLAHGATYPR VLKPAFGSGS VGVRLVRTPA EMLAHGERML
DARGNERGIA LARQVLVQSF VDGPEFSVEV VGLGAEHGHA VLGVTGKHLG PLPHFVEAGH
DFPAPIAAAQ RDAIVAETLR ALDAVGHRFG PAHVECRVSD GKVVVIEINP RLAGGMIPQA
IEWATGVDVL GAMIDLHAGT PPDLGPRRRG HAAIRFVLPA RSGELRALSF EPDERFAGVR
TRFMPLKQLG QRIEPAGDFR DRLALVIASA ADPDALAHAL EDVDRCVTVA IGDAGAAGEG
AGAGRLRRTL HPEALAIVRK PAPRAERLAE LDAFAAIDEA HLLMLVDAGI CDRARAATVL
AELARQRDAK FAAIADAIAP RGTYALYEQL LIERVGIDAG GAVHTARSRN DINACVAKLR
AREWFDTCGG KLWRVRAAIV DKAQHTLDWP LPTYSQYQAA QPGSFGYYLW SVETALRRDQ
AALERLDEEL AVCPLGAGAG AGTDFPIRPG VSAALLGFAR SFDSALDAVA SRDLVLHFLA
AIAIASTTLS RLAHDLQLWT MRETDFLALP DELSGGSSLM PQKKNPYLLE IVKGKLAHVA
GALNAAVFAS QRTPFSNSVE IGTEMLAPCA DAVQAFGESC DLLRLMVSGV TGDPAKMRAA
AEAGLVSATQ VANALVRETD ISFHAAHRQI GALITQALDA HEDPAAALDA LVRQPGASID
EAAARLAYGG GPGAAGAGLA RSRALLRQSA ERLWRRRAAW HAAHARRRGC VADLLAAAAA