Gene Franean1_2978 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2978 
Symbol 
ID5671362 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3503280 
End bp3504569 
Gene Length1290 bp 
Protein Length429 aa 
Translation table11 
GC content71% 
IMG OID641241882 
Productamidohydrolase 
Protein accessionYP_001507302 
Protein GI158314794 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG1228] Imidazolonepropionase and related amidohydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGCGC CGATCGTCCT GCGCGCCGCA CGCTGGCTGG ACATCGACGC CGGAGAGGTC 
CGCGCGCCCG CGGAGATCGT CGTAGAGGGC GACCGCATCA CCGCGGTGAA CCCCGCCACG
CCGCCTGCGG GCAGCGTGGA ACTCGACCTG GGCGACGTCA CGTTGCTCCC CGGGTTGATG
GACATGGAGC TCAACTTCCT CATCGGCGGG CCCGAGACCC CGACGGGACT GCCGCTGCCC
ATGCACGGCG TGCAGGACGA CCCGGCGTAC CGCACCATCC GAGGAACCAT CAACGCCCGT
ACGACGCTGC ACGCTGGCTT CACCACGGTG CGCAACCTCG GTCTGATGGT CAAGACCGGG
GGGTACCTGC TCGACGTAGC ACTCCAGCGC GCCATCGACC AGGGATGGGC CGAGGGCCCG
CGGATCATCG CGGCCGGGCA CGCGGTGACC CCGTACGGCG GGCACCTCGA CCCGACCGTG
TTCCAGCGGC TGGCGCCCGG CGTCATGCCC CTGTCGATCG GCGAGGGGAT CGCCAACGGT
GTGGGCCAGG TGCGGGAATG CGTCCGCTAC CAGATCCGCC ACGGTGCCAG GGTCATCAAG
GTGTCGGCCT CCGGCGGGGT GATGTCGCAC AGCACCGGGC CGGGCGCCCA GCAGTACTCC
GACGAGGAGC TGGCCGCGAT CGCGGACGAG GCGCACCGGG CCGACATCCG GGTCGCCGCG
CACGCGGTGG GCGACCGGGC GATCCGGGCC TGCGTGCGTG CCGGGATCGA CTGCATCGAG
CACGGCTTCC TCGCCAGCGA CGACACCCTC AAGCTGATGG CCGACCACGG CACGTTCCTG
GTGTCGACGA CCTACCTGAC CGACGCGATG GACATCGCCC GGGCCGCACC CGAGCTGCGC
AAGAAGGCGG CAGTGGTGTT CCCCCAGGCC AGGGCGATGC TCCCGAAGGC GATCGCGGCC
GGGGTGCGGA TCGCCTGCGG CACCGACGCG CCCGCCGTGC CACACGGTCA CAACGCCAAG
GAACTGATCG CACTGGTGTC GCGGGGCATG ACTCCCGTCC AGGCCCTGCG GGCCGCGACC
GTCACCAGCG CGGAGCTCGT CGAACTGGAC CACGAGCTCG GACGGTTGAA GGCCGGCTAC
CTCGCCGACA TCATCGCCGT CCCCGGCGAC CCCTCCCAGG ACATCACCCG CACCGAGGAC
GTCCGCTTCG TCATGAAGGA CGGCCTCGTC CACCGCGACG ACCGATCGTC ACCGACCCGA
ACGGAGCACA CATGGCAAGC GGCGTCCTGA
 
Protein sequence
MTAPIVLRAA RWLDIDAGEV RAPAEIVVEG DRITAVNPAT PPAGSVELDL GDVTLLPGLM 
DMELNFLIGG PETPTGLPLP MHGVQDDPAY RTIRGTINAR TTLHAGFTTV RNLGLMVKTG
GYLLDVALQR AIDQGWAEGP RIIAAGHAVT PYGGHLDPTV FQRLAPGVMP LSIGEGIANG
VGQVRECVRY QIRHGARVIK VSASGGVMSH STGPGAQQYS DEELAAIADE AHRADIRVAA
HAVGDRAIRA CVRAGIDCIE HGFLASDDTL KLMADHGTFL VSTTYLTDAM DIARAAPELR
KKAAVVFPQA RAMLPKAIAA GVRIACGTDA PAVPHGHNAK ELIALVSRGM TPVQALRAAT
VTSAELVELD HELGRLKAGY LADIIAVPGD PSQDITRTED VRFVMKDGLV HRDDRSSPTR
TEHTWQAAS