Gene BURPS668_2507 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_2507 
Symbol 
ID4884988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp2467656 
End bp2470394 
Gene Length2739 bp 
Protein Length912 aa 
Translation table11 
GC content71% 
IMG OID640128435 
Productargininosuccinate lyase 
Protein accessionYP_001059534 
Protein GI126438665 
COG category[E] Amino acid transport and metabolism
[I] Lipid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase
[COG0439] Biotin carboxylase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase
[TIGR02019] bacteriochlorophyll 4-vinyl reductase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGGGCA TCTTTGTCTT CATCGAGAGC AACACGACGG GCACGGGCGA GCTCCTCGTG 
CGCAAGGCGC TGCAACGCGG CCTCACGCCT TACTTTCTCA CCGCGAACCG CGGCAAGTAT
CCGTTTCTCG ACGCGATCCG GGTCGTGACG ATCTCGCTCG ACACGAGCGA CGCCGACCGG
ATCCACCGCT TCGTTTCGTC GCTCGACGGC GTGGCGGGCG TGATGTCGTC GTCCGAGTAT
TTCATCGAAG TCGCGAGCGA AGTCGCGCGG CGGCTCGGGT TGCCGACCGC GAACACCGAA
GCGACGCGTG TGTGCCGCGA CAAGAAGCGT CTCGCGCGGA CGCTCGCCGA GCACGGGATA
GACGTGCCGC GCACGCACGC GCTTGCGCTC GACGCCGACG CCGACGCCGA TGCCGATGCC
GATGCCGATG CCGACGCCGT CGCGCTGTCC GCGCTCGACG GGCTCGCCTA TCCGGTTGTC
GTCAAGCCGA GGATGGGCTC CGGCAGCGTC GGCGTGCGAC TGTGCGCGAG CGTCGACGAG
GCGGCCGAGC ACTGCGCGGC GCTGCGGCGC GCGGGCACGC GCGCGGCGCT CGTGCAGGCC
TATGTCGAAG GCGACGAGTA TTCGGTCGAG ACACTGACGG TCGCGCGCAG CACGCAGATC
GTCGGCATCG TCAGAAAACG CCTCGGGCGC GAGCCGCACT TCGTCGAGAT CGGTCATGAC
TATCCGGCGC CGTTGTCGAG CCCGCAGCGC GAGCGCATCG AGCGCACGGT GCTGCGCGCG
CTCGAGGCGC TCGGCTACGC GTTCGGGCCC GCGCACACCG AGCTGCGCGT GCGCGGCGAC
ACGGTCACGA TCATCGAGAT CAATCCGCGC CTCGCGGGCG GTCTGATTCC GGTGCTGCTC
GGCGAGGTAT TCGACGTCGA CCTGCTCGAC CACGTGCTCG ACATGTGGCT CGGCGTGGCG
GCGTTTGCCG ATCTCACCGC GAAACGCTAC GGCGCGATCC GCTTCGCGCT GCCGGCGCGC
GAGGGCGTGC TGCGCGGCCC GCTCGCGCTG CCGGCCGACA TCGCCGCGCG GCCCGAGCTC
CGGCATTTCC ATCCGATCGC GCAGCCGGGC GACGCGCTGC GGCTCGAAGG CAGCTTCCGC
GACCGCATCG CCGCCGTCGT CTGCGCGGGC GATCATCGCG AATCGGTCGA GGCGCTCGCC
GAGCGTGCGG TGGCCGGGCT GAGCATCGAC ATCGGCGACG ACGCGCGCGC AGCCGCGTTA
AACGAGTCGA ACGAGTCGAA CGGCGCGAAC GGCGCGAACG GCGCGAACGC GGCGACGCCC
GGCCTGCCGC CTCGCCTGCA GGCGATCGTC TACGGCGACG GCGCGAGCGA GGCCCCACTC
GCGGAACTCG ACCATCTGTT CGATCTGAAC GAGGCGCATC TCGTCATGCT CGGGGCGACG
CGCATCGTCG CGCCCGAACG GGTCCGGCCG CTGCTCGACG CCCATCGCCG GCTGCGGCGC
GCGGGCTACG CGCCGCTGCT CGCGCGGCCG AGGCCGCGGG GCCTGTACAT GCTCGTCGAG
GCATACCTGA TCGAGACGCT CGGCGAGGAT GTCGGCGGCG TGCTGCAGAC GGGCCGCTCG
CGCAACGACA TCAACGCGGC GACGACGAAG CTGCATTTGC GCGACGCGAC GTCGCGCGCG
TTCGACGCGC TATGGCGCTT GCGGCGCAGC CTCGTCTTCA AGGCGTCGGC GAACGTCGAC
TGCGCATTTC CGATCTACAG CCAGTACCAG CCGGCGCTGC CGGGCACGCT CGCGCATCAG
CTGCTCGCGT TCGACGGCGC GCTCGCGCAC GAAACCCATG CGCTGTTCGC GTTGTTCCAG
CACATCGATG TCTGCCCGCT CGGCGCCGGC GCGGGCGGTG GCACGACGCT GCCGATCGAT
CCGGAGTTCG TCTGCCGGCT GCTCGGCTTC GAGCAGCCGG CGCCGAACAG CCTCGATGCG
GTGGCGAACC GCAGCGGCGT CGTCCATTTC CTGTCGGCGA TGAACGCGAT CGGCCTCGTG
CTGTCGCGTC TCGCGCAGGA CCTGCAGATC TGGACGACGG CGGAGTTCGC GCTCGTGTCG
CTGCCCGCCG CGCTGACGGG CGGCTCGTCG ATGCTGCCGC AAAAGAAAAA CCCGTTTCTC
GTCGAATTCG TGAAGAGCCG CGCGGGCGTG CCGTTCGGCG CGCTCGCGAG TTGCTCGGCG
GCGCTCGGCA AGACGCCGTA CACCAATTCG TTCGAGGCGG GCTCGCCGAT GAACGGGCTG
ATCGCGCAGG CGTGCGCGGC GATCGAGGAC GCGGCGGCGG TCGCCGTGCT GCTGATCGAC
GGGCTCGAAG CGGCGCAGGC ACGCATCGAC GCCCATCTGA GGGACACGGG CGTGGTCGCG
ATGGCGGTGG CCGAATCGCT CGCCGTTCGC CGGTCGATCG ATTTTCGCTC CGCGCACACG
CGGGTTGCGC AGGCGGTGCG GGACAGCGCC GCGCAGGGGC GCTCGAGCCA CGATGCGCTC
GCCGCGCTCG ACCCCGATTT CGTCTCGCGC GCGCCGCTGG AGTGGGCGCG CAGCCACCGT
TTCGGCGGCG GCCCGGGCGC GGCCGACCTG AACCATGGCG TCGCGCGCGC GTGCCGTGCG
CTCGCCGACG ACGAGGCCGT GTTTCGCCGC AAGCAGGACG TGTGGCGGGA GGCGGAACAG
ATGCGGCGGC TCGCGGCGCA GCAACTGGCG GGCGATTGA
 
Protein sequence
MTGIFVFIES NTTGTGELLV RKALQRGLTP YFLTANRGKY PFLDAIRVVT ISLDTSDADR 
IHRFVSSLDG VAGVMSSSEY FIEVASEVAR RLGLPTANTE ATRVCRDKKR LARTLAEHGI
DVPRTHALAL DADADADADA DADADAVALS ALDGLAYPVV VKPRMGSGSV GVRLCASVDE
AAEHCAALRR AGTRAALVQA YVEGDEYSVE TLTVARSTQI VGIVRKRLGR EPHFVEIGHD
YPAPLSSPQR ERIERTVLRA LEALGYAFGP AHTELRVRGD TVTIIEINPR LAGGLIPVLL
GEVFDVDLLD HVLDMWLGVA AFADLTAKRY GAIRFALPAR EGVLRGPLAL PADIAARPEL
RHFHPIAQPG DALRLEGSFR DRIAAVVCAG DHRESVEALA ERAVAGLSID IGDDARAAAL
NESNESNGAN GANGANAATP GLPPRLQAIV YGDGASEAPL AELDHLFDLN EAHLVMLGAT
RIVAPERVRP LLDAHRRLRR AGYAPLLARP RPRGLYMLVE AYLIETLGED VGGVLQTGRS
RNDINAATTK LHLRDATSRA FDALWRLRRS LVFKASANVD CAFPIYSQYQ PALPGTLAHQ
LLAFDGALAH ETHALFALFQ HIDVCPLGAG AGGGTTLPID PEFVCRLLGF EQPAPNSLDA
VANRSGVVHF LSAMNAIGLV LSRLAQDLQI WTTAEFALVS LPAALTGGSS MLPQKKNPFL
VEFVKSRAGV PFGALASCSA ALGKTPYTNS FEAGSPMNGL IAQACAAIED AAAVAVLLID
GLEAAQARID AHLRDTGVVA MAVAESLAVR RSIDFRSAHT RVAQAVRDSA AQGRSSHDAL
AALDPDFVSR APLEWARSHR FGGGPGAADL NHGVARACRA LADDEAVFRR KQDVWREAEQ
MRRLAAQQLA GD