Gene Franean1_4173 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4173 
Symbol 
ID5672528 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4960385 
End bp4962076 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content77% 
IMG OID641243046 
Productamidohydrolase 3 
Protein accessionYP_001508463 
Protein GI158315955 
COG category[R] General function prediction only 
COG ID[COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.000539132 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.473952 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCACACTT GCCCACCGCG GACCACAGGT GAGCGCCGCC CACCCGGCAC AGTAGGGACC 
GTGAGCCCCC ACCACCTGCC GGCAGCCCGG CGCGTCCTCT ACCACCACGG ACACGTCCAC
AGCCCCACCC ACCGGCACGC CACCGCCCTG CTCACCGACG GCCCCACCAT CACCTGGATC
GGCACCGACG ACCAGCCCGA CGCCAGCGGG CCGCTGCCAG CCGGCCCGGT CGACCACACC
GTCGACCTGC GCGGCGCCAC CCTCACCGCC GCCTTCGTCG ACGCCCACCT GCACACCACC
GCCACCGGGC TCGCCCTCGA CGGCGTCGAC CTCACCGACA GCCCGTCCCT GCGCCACACC
CTCGACCAAC TCACCCGCGC CGCCCACCAC CGGCCCGGCC AGCCACTGCT CGGCACCGGC
TGGGACGAAA CCCGCTGGCC CGAACAGCGA CCCCCCACCA GCGGGGAACT CGCCCGCGCC
GCCGGGCCCG TCGACGTCTA CCTCGCCCGC GCCGACGGCC ACACCGCCGT CATCTCCCCG
CACCTGGCCA CCCGCAGCGG CGCCCACCAC GCCACCGGCT GGCTCGGCGA CGGCCTGTGC
CGCGACGACG CCCACCACCT CGCCCGCACC GCCGCCTACC ACGACCTGCC CCCCACCACC
CGCCGCGCCG CCGCCCGCCG CGTCCGCGCC CACGCCGCCA CCCTCGGCAT CGCCGCCCTG
CACGAGATGG CCGGCCCGCA GGTCTCCTCC GCCGACGACC TCGCCGCCCT GCTCACCCTC
GCCCGCGACG AACCCGGCCC CACCATCACC GGCTACTGGG CCGGTGAACT CACCGTCGCC
ACCGCCCTGA ACACCGAACC CGGCCTGGGC CCCGTCGGCT ACGGCGGCGA CCTGTTCGTC
GACGGCTCCC TGGGCTCACA CACCGCCGCG CTACGCAGCC CCTACACCGA CCAGCCCACC
CACCGCGGGC AGCTGCACCG CGACGCCGAC GACGTCCGCG ACACCGTCCT CGACGCGGTC
GCCGCCGGCC TGCAGACCGG CTTCCACGCC ATCGGCGACG CCGCCCTCGA CACCGTGCTC
GACGGGGTGC GCGCCGCCAC CGCCCGCGTC GGCACCGCCA CGATCAGCGC CGGCACCCAC
CGGGTCGAAC ACGCCGAGCT GCTGCACCCC GAACAGATCA TCGCCATGGC GCGCCTGGGG
CTCGTCGCCT CCGTGCAGCC CGCGTTCGAC GCCCGCTACG GCGGCCCCGA CGGCCTGTAC
ACCCGCCGGC TCGGCGCCGA CCGTGCCAGC GCGATGAACC CGTTCGCCGC GCTGCACCGC
GCCGGGGTCG TACTCGCCCT GTCCTCCGAC AGTCCCGTCA CCCCCCTCGA CCCGTGGGGA
GCGGTACGCG CCGCGGCCAC CCATCACACC CCGTCCGCGC GGATCAGCGG TGCCGCGGCG
TTCACCGCCG CCACCCGCGG CGGCTGGCTG GCCGCCCGCG CCGGCGGTGA CGGTGCTGGA
CGGATCACCG TCGGCGCGCC CGCGACCTTC GCGATCTGGG AGACCCCCCA CCCGCCGCGG
CCGGCCAGGC CGCCAGCCGC CCAGCCCGCC GGGCCGCTGG ATGTTCTCCT TGACCAGCTT
GACCGCACCG GCAGCGCGCC ACGCTGCCTG CGCACCGTGC TGCGCGGACA GACCCTGCAC
GACCTGCTCT GA
 
Protein sequence
MHTCPPRTTG ERRPPGTVGT VSPHHLPAAR RVLYHHGHVH SPTHRHATAL LTDGPTITWI 
GTDDQPDASG PLPAGPVDHT VDLRGATLTA AFVDAHLHTT ATGLALDGVD LTDSPSLRHT
LDQLTRAAHH RPGQPLLGTG WDETRWPEQR PPTSGELARA AGPVDVYLAR ADGHTAVISP
HLATRSGAHH ATGWLGDGLC RDDAHHLART AAYHDLPPTT RRAAARRVRA HAATLGIAAL
HEMAGPQVSS ADDLAALLTL ARDEPGPTIT GYWAGELTVA TALNTEPGLG PVGYGGDLFV
DGSLGSHTAA LRSPYTDQPT HRGQLHRDAD DVRDTVLDAV AAGLQTGFHA IGDAALDTVL
DGVRAATARV GTATISAGTH RVEHAELLHP EQIIAMARLG LVASVQPAFD ARYGGPDGLY
TRRLGADRAS AMNPFAALHR AGVVLALSSD SPVTPLDPWG AVRAAATHHT PSARISGAAA
FTAATRGGWL AARAGGDGAG RITVGAPATF AIWETPHPPR PARPPAAQPA GPLDVLLDQL
DRTGSAPRCL RTVLRGQTLH DLL