Gene Glov_0163 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGlov_0163 
Symbol 
ID6367734 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter lovleyi SZ 
KingdomBacteria 
Replicon accessionNC_010814 
Strand
Start bp153626 
End bp155389 
Gene Length1764 bp 
Protein Length587 aa 
Translation table11 
GC content60% 
IMG OID642675560 
Productpara-aminobenzoate synthase, subunit I 
Protein accessionYP_001950417 
Protein GI189423240 
COG category[E] Amino acid transport and metabolism
[H] Coenzyme transport and metabolism 
COG ID[COG0115] Branched-chain amino acid aminotransferase/4-amino-4-deoxychorismate lyase
[COG0147] Anthranilate/para-aminobenzoate synthases component I 
TIGRFAM ID[TIGR00553] aminodeoxychorismate synthase, component I, bacterial clade 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCTGACA CCTGCCCTAC CATACTGCTG GACTCCTGCA GCGCAGACCG TTTCAGCGCA 
TCCTGGCGCT TTGACGGCCA CATCCGTACC CTGATTGCCG AGACCTCTGA TCAGGTGCAA
TCAGTACTTG AGCAGGCCGA GGCAGCCACC CGGCAGGGGC TGTATGCCGT GGGATTTGTG
GTCTATGAGG CCGCCCGGGC ATTAAACCCG CACCTGCCGT CGCTGCCACC CCGCGCCGGC
CTGCCGCTGG CCTGGTTCAG CCTGTTCAGG GAGCGCCATT GCGTAACAGC GGGGGATGGC
CTGCCGGACC ATACGACTGC CACACCGGAA CTCCAGCCTG CCTGCAGTCC GGCCGACTAC
GGGATTGCCA TCAGCCGGAT TCACACGGCG ATTGAGCAGG GTGAGACCTA TCAGATCAAC
CACACCTTTC CACTGCAGGG GCAGTGGCAG GGTGATCCGC GACAGCTCTA TCGCAGCCTG
TTACTGGCCC AGCAACCGGC CTTTGGCGCC TTTCTGGATA TCGGCAGCCA CACGATCATC
TCTGCTTCAC CAGAGCTGTT CTTCAATATC AAGGATGGTC TGATCACCAC CAGGCCGATG
AAAGGGACCG CCCCCCGCGG ACGCTTTCCT GCTGAAGACC GGGCCCTGCA AGAACAGCTG
CAGCAGGATA TGAAGGAGCA GGCCGAGAAC CTGATGATCG TTGACCTGCT GCGCAACGAC
CTGGGGCAGG TGGCCCGGAC CGGCACGGTG CAGACCGAGA GGTTGTTTGA GGTAGAGACC
TATCCCACGG TGCATCAGAT GACCTCCACC ATTACCGCAC AACTGAAGCA GGATATCGGC
CTGCTGGAGC TGTTCAGGGC CCTGTTCCCC TGCGGGTCGG TCACCGGTGC TCCCAAGCGC
CGCAGCATGG AGTTGATTGC TGAAATCGAA GGTCAACCGC GCGGGATCTA CTGCGGTACC
ATCGGCTATC TGGCTCCCGG GGGTGAAATG GCCTTTTCAG TTGCCATCCG CACCTTGGTG
CTAAACAAAC AGACCGGCCG GATCAGCCTG GGGGTGGGCA GTGGGATTAC CTGGGATGCC
CGACCCGATG CCGAGTATGT TGAATGCCTG CACAAGGCCG CCTTTCTCAA ACCGCGTCCG
CAACCCAGAC TGCTGGAATC ACTGCTGTTG GAAGACGGCA ACTATCCCCG CCTGGAGCAG
CACCTTGAAC GGCTCGGCTG GTCTGCGGCC CGGCTGGGCT ATTGTTGTGA CCGGGAACAG
ATCAGACAGG CGTTGCTGGC CCATGCCGCC GGCACAACCG GTCAGCACAA GACCCGGCTG
CTGCTGGCAC AGGATAGTAC CTTTCAGATT GAATCAGCCC TGTTACTACA GATCCAGCAG
CCGCTGAAGC TTGCTCTGGC CACAACATTT GTAGACCCAA CTGACCTGCT GCTGTACCTC
AAAACCGAAC AACGCCAGCG CTACGAACAG GCCCGTCAGG AACAGCCAGA GGCGGATGAG
GTGTTGCTCT GCAACAATCG GGGTGAACTG ACTGAGGGTA GTTTCACCAA TCTGGTGCTG
AAGCTGGATG GTCGGCTGGT AACCCCGCCG CTGGCCAGTG GTCTGCTGCC GGGGGTGATG
CGTCAGCAAC TGCTGGAACA GGGAACCATA GAAGAGCAGG TGTTATACCC GCAGGATCTG
CAGCGGGCTG AAGAGATCTG GCTGATCAAC AGCGTACGGG GCTGGCTGCG GGCAGAGCTG
ATTAAAGGAG CAAGAACGTG CTAA
 
Protein sequence
MPDTCPTILL DSCSADRFSA SWRFDGHIRT LIAETSDQVQ SVLEQAEAAT RQGLYAVGFV 
VYEAARALNP HLPSLPPRAG LPLAWFSLFR ERHCVTAGDG LPDHTTATPE LQPACSPADY
GIAISRIHTA IEQGETYQIN HTFPLQGQWQ GDPRQLYRSL LLAQQPAFGA FLDIGSHTII
SASPELFFNI KDGLITTRPM KGTAPRGRFP AEDRALQEQL QQDMKEQAEN LMIVDLLRND
LGQVARTGTV QTERLFEVET YPTVHQMTST ITAQLKQDIG LLELFRALFP CGSVTGAPKR
RSMELIAEIE GQPRGIYCGT IGYLAPGGEM AFSVAIRTLV LNKQTGRISL GVGSGITWDA
RPDAEYVECL HKAAFLKPRP QPRLLESLLL EDGNYPRLEQ HLERLGWSAA RLGYCCDREQ
IRQALLAHAA GTTGQHKTRL LLAQDSTFQI ESALLLQIQQ PLKLALATTF VDPTDLLLYL
KTEQRQRYEQ ARQEQPEADE VLLCNNRGEL TEGSFTNLVL KLDGRLVTPP LASGLLPGVM
RQQLLEQGTI EEQVLYPQDL QRAEEIWLIN SVRGWLRAEL IKGARTC