Gene Francci3_2992 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2992 
Symbol 
ID3905489 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3542899 
End bp3544368 
Gene Length1470 bp 
Protein Length489 aa 
Translation table11 
GC content74% 
IMG OID637880312 
Productamidohydrolase 
Protein accessionYP_482078 
Protein GI86741678 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.278958 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGATC CCCCGGCATC ATCGGAACAA TCCCTGACTC CGGAGCAGCC TCCGGCGCAC 
GGGCGATCCA CCGCGACGGA ACAGTCCGAG TCCGGGCGTC CCGAGTCCGG GCGGTCGACG
GTGCCCGGAC GATCAGAGGC GGCCGAGGCC TTCGTCCTGC GGGGGGTGCA CGTCCTCGAC
AGGACCGGGC GCTTCGGTGA GCCGACCGAC GTCGCCGTCG GCGGCGGGAT CATCCATGCG
GTCGGCACGC GACTCGGTCT TGACCGGGAC GCCGTCGACG TTGACGCCCA CGACCTGTGG
CTGATGCCAG GGGTCGTCGA CTGCCACGCC CACGTCACGC AGTCCACCTA TGATCCGTTC
GAGCTGGCGA CGGCGTCGCT GTCCACCCGG GTCCTGCAGA CCGCCGCCGC GTTGCGGGCG
TCGTTGGCGG CCGGTGTCAC ACACCTGCGG GACGCCGGCG GCGCCGACGC CGGCATCCGG
GACGCGGTCA GCGCGGGCAC CGTGCCCGGA CCTCGGCTTG CCGTGTCCGT CGTCGGGCTG
AGCCGGTCCG GGCGGCACGG TGACGGGGCG ATCATCGGTC CCGGGCTGGA GTCGGCCGAA
GATCTTCTCA TGCCGGACTA TCCCGGCCGG CCCCCGCATA CCGTTACCGA GGCGGGCAGC
CTGCCGGCCT CGGTGCGGGG CATTCTGCGG GCGGGTGCGG ACTGGATCAT GATCTACGCG
AGCGGCGGGG TGATGTCCGC CCGGCCGGGG CAGCCCGAGC CGCAGTTCAG CCCGGTGGAG
CTTGCGGCCG CCGTCGCGGA GGCCCGGCGG TACGGACGTC CGGTGATGAT GCACGCCTTG
GGCGAGCGGT CGATCGAGGC CGCGGTCGCA GCCGGAGCGC GTTCCATCGA ACACGGCATC
GGGCTCACCG AGCCCGTGGC GGCCGCCATG GCGGCCGCCG GGGTGACGCT CGTCCCCACC
CTGTCGCCCT ACCAGGACCT GGCGGCGCTG GCGGCCACCG GAGTGCTGCC GGGCTGGGCG
GCGGACCGAG CCGAGGCCAC CGAGGCCGCG CTGGCCGGCA CGATCGCGGT CGCCCGTGCG
GCGGGGGTGC CGATCGCGCT CGGCAGCGAC GCCCGGCACC GCACCCGGCA CGGCGCGAAC
CTGGCCGAGA TCAGCCGGCT GCGCCATGCC GGGCTCACCC CACCCGAGGC GCTGCTCGCC
GCGACCGCGA CCGGAGCCCG GCTCTTCGGA CTGGGTGAGG GAGCCGGCCG CATCGCCGTC
GGCTCCGCCT TCGACGCGAT CCTGCTCGAC GCGGACCCCG GGGACCTGTC GATCTTCGAG
CGGCCGGGCG CTGTGAGCGG AGTGTTCCTT GGCGGTCGCG CGGTGCTGCC CCACCCGCGG
CTGCCCGGGA AGCTGGTCCG CCCCACCATG GTGATGGAAG AGGTGCCGAG AATCGTTCCC
GAGCGGCCCG CGGTGTCGCC GCCCGGCTGA
 
Protein sequence
MTDPPASSEQ SLTPEQPPAH GRSTATEQSE SGRPESGRST VPGRSEAAEA FVLRGVHVLD 
RTGRFGEPTD VAVGGGIIHA VGTRLGLDRD AVDVDAHDLW LMPGVVDCHA HVTQSTYDPF
ELATASLSTR VLQTAAALRA SLAAGVTHLR DAGGADAGIR DAVSAGTVPG PRLAVSVVGL
SRSGRHGDGA IIGPGLESAE DLLMPDYPGR PPHTVTEAGS LPASVRGILR AGADWIMIYA
SGGVMSARPG QPEPQFSPVE LAAAVAEARR YGRPVMMHAL GERSIEAAVA AGARSIEHGI
GLTEPVAAAM AAAGVTLVPT LSPYQDLAAL AATGVLPGWA ADRAEATEAA LAGTIAVARA
AGVPIALGSD ARHRTRHGAN LAEISRLRHA GLTPPEALLA ATATGARLFG LGEGAGRIAV
GSAFDAILLD ADPGDLSIFE RPGAVSGVFL GGRAVLPHPR LPGKLVRPTM VMEEVPRIVP
ERPAVSPPG