Gene Franean1_1408 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1408 
Symbol 
ID5669814 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1704654 
End bp1705862 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content70% 
IMG OID641240331 
Productamidohydrolase 
Protein accessionYP_001505758 
Protein GI158313250 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.321357 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACATCG ACGCCGGTGT CGTGCGGTCG CCCGCGGTGA TCGTGGTGGA GGGTGGTCGC 
ATCACCGCGA TCGGCCCCAC CGAGGTCCCC GCGAACGCGA CCGAGTTCGA CCTGGGCGAT
GTCACGCTCC TGCCTGGGCT CATGGACATG GAGCTGAACC TGCTCATCGG CGGTCCGGGC
GGCCCGGACG GGCTACCCAA CCCGATGCAT GGCGTCCAGG ACGACCCGGT GTACCGGACC
CTGCGTGGCG CGGTGAACGC CCGCGCCACG CTCCAGGCCG GCTTCACCAC CGTGCGCAAC
CTGGGGCTGA TGGTCAAGAC CGGCGGGTAC ATGCTCGACG TCGCGCTGCA GCGGGCGATC
GACCAGGGCT GGCACGAGGG CCCCCGCATC GTCCCCGCGG GCCACGCCGT CACGCCGTAC
GGCGGCCACC TCGACCCGAC GGTGTTCCAG CGCCTCGCAC CCGGGATCAT GCCGCTCAGC
GTGGCCGAGG GCATCGCCAA CGGCGTGGCC GAGGTGCGCG CCTGCGTCCG ATACCAGATC
CGCCACGGCG CCAAGGTGAT CAAGATATCG GCGTCGGGGG GCGTGATGTC GCACTCCACG
GCGCCGGGCT CGCAGCAGTA CTCCGACGAG GAGTTCGTCG CGATCGCCGA CGAGGCGCAC
CGGGCGGGAA TCCGGGTCGC CGCGCACGCG GTCGGCGACA GCTCGGTCCA GGCCTGCATC
CGGGCCGGTG TGGACTGCAT CGAGCACGGC TTCCTCGCCA CCGACGAGTC CATCCAGATG
ATGGTCGACC ACGGGACGTT TCTGGTGTCG ACCACATACC TCACCGAGGC GATGGCCATC
GAGCGGGCCG CGCCCGAACT CCAGAAGAAG GCCGCTGAGA TCTTTCCTCA GGCCAAGGCG
ATGCTGCCCA AGGCGATCGC CGCCGGAGTG AAGATCGCGT GCGGTACGGA CGCGCCGGCG
ATCCCGCACG GGGAGAACGC CATGGAGCTC ATCGCGCTGG TCGACCGTGG CATGACTCCG
ATGCAGGCGC TGCGGGCGGC GACCGCGACG AGCGCGGAGC TCATCCAACG GGAGGACGAG
CTCGGCCAGC TCGCCGTCGG GTATCTCGCC GACATCATCG CGGTGCCTGG TGATCCGTCC
GAGGACATCG CCGCCACACG GGACGTCCGT TTCGTTATGA AGGAGGGCCG TGTCTACAAA
CACCTCTGA
 
Protein sequence
MDIDAGVVRS PAVIVVEGGR ITAIGPTEVP ANATEFDLGD VTLLPGLMDM ELNLLIGGPG 
GPDGLPNPMH GVQDDPVYRT LRGAVNARAT LQAGFTTVRN LGLMVKTGGY MLDVALQRAI
DQGWHEGPRI VPAGHAVTPY GGHLDPTVFQ RLAPGIMPLS VAEGIANGVA EVRACVRYQI
RHGAKVIKIS ASGGVMSHST APGSQQYSDE EFVAIADEAH RAGIRVAAHA VGDSSVQACI
RAGVDCIEHG FLATDESIQM MVDHGTFLVS TTYLTEAMAI ERAAPELQKK AAEIFPQAKA
MLPKAIAAGV KIACGTDAPA IPHGENAMEL IALVDRGMTP MQALRAATAT SAELIQREDE
LGQLAVGYLA DIIAVPGDPS EDIAATRDVR FVMKEGRVYK HL