Gene Franean1_0619 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0619 
Symbol 
ID5669036 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp720249 
End bp721457 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content67% 
IMG OID641239546 
Productamidohydrolase 2 
Protein accessionYP_001504984 
Protein GI158312476 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.6071 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGTCAT CACAGTCGCT GGACTGGCTG ATCTCGGTCG ACGACCACGT CCTGGAGCCG 
CCGAACCTGT GGACCGACCG GCTCCCGGCC AAGGACCACG ACCGGGCTCC CCACATGGTG
ATCGACGACA CGGGAATGGA CTGCTGGGTC TACGACGGCA AGCGTTTCCC GAGCTCCGGG
CTGAGCGCCG TCGCCGGGAA GGAGAAGGAG GAGTTCAGCC CCGAGCCCCT CTCCTATGCC
GACATGCGGC CCGGTTGCTA CGACCCGCAG GCCCGCCTGG AGGACATGAA CCGGGCCGGC
ATCCTGGCCT CGCTGTGCTT CCCGACGGTG ACCCGGTTCT GCGGGCAGAT GTTCTCCGAG
GCGAGCGACC GCGAGTTCGG CCTGGTGTGC CTGAAGATCT ACAACGACTG GATGATCGAG
GAGTGGTGCG GCAGCGCTCC CGGCCGCTAC ATCCCGCTCA CCCTCATCCC GCTGTGGGAC
CCGCAGCTCG CGGTGAAGGA GCTCGAGCGC TGCGCGGCGA AGGGGTCCAC CACCTTCGCC
TTCTCGGAGA ACCCGGCCCC GCTGGGCCTG CCGACCATCC ACGACCGCGA CGGGTACTGG
GAGCCGGTGA TGGCTGCCGC GAACGACCTG GAGATGGTCG CGTCGATGCA CGTCGGCTCC
TCGTCGCAGG TGCCGAAGAT CGCTCCCGAC GCGCCGTTCA TGGCGAACCT GACCTGGGGC
GCGATGCGTA CCTCGGGCGC CATGCTCTCC TGGCTGTTCA GCGGGATGTT CCAGCGGTAC
CCGAAGCTGA AGATCGCGCT CTCGGAGGGC GAGATCGGCT GGATGCCGTA CTACCTGGAG
CGCGCCGAGC AGGTGATCGA CAAGCAGCGC CACTGGGTCA AGCGTGGTGT CCGCTTCAAC
GAGCACGCCG GGGCGGACGC CCTCGACCTG GACACCCTCG ACATCCGCGC CAGCTTCCGT
GAGCACGTCT TCGGTTGCTT CATCGACGAC GCGCACGGCA TCGCCAGCAT CGACGAGATC
GGCGAGGACA ACATCATGTG CGAGACGGAC TACCCGCACT CCGACTCGAC CTGGCCGAAC
TGCATCGACG TCGTCAGGAA CCGGATCGGC CACCTGTCGG AGGAGGTCCA GTACAAGATC
CTGCGCGGCA ACGCCGAGCG GCTGTACCGG TTCACCCCGG CCGAGCCGCC TGTGCTCGCG
AAGGCCTGA
 
Protein sequence
MTSSQSLDWL ISVDDHVLEP PNLWTDRLPA KDHDRAPHMV IDDTGMDCWV YDGKRFPSSG 
LSAVAGKEKE EFSPEPLSYA DMRPGCYDPQ ARLEDMNRAG ILASLCFPTV TRFCGQMFSE
ASDREFGLVC LKIYNDWMIE EWCGSAPGRY IPLTLIPLWD PQLAVKELER CAAKGSTTFA
FSENPAPLGL PTIHDRDGYW EPVMAAANDL EMVASMHVGS SSQVPKIAPD APFMANLTWG
AMRTSGAMLS WLFSGMFQRY PKLKIALSEG EIGWMPYYLE RAEQVIDKQR HWVKRGVRFN
EHAGADALDL DTLDIRASFR EHVFGCFIDD AHGIASIDEI GEDNIMCETD YPHSDSTWPN
CIDVVRNRIG HLSEEVQYKI LRGNAERLYR FTPAEPPVLA KA