Gene Franean1_5574 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5574 
Symbol 
ID5673902 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6754895 
End bp6756040 
Gene Length1146 bp 
Protein Length381 aa 
Translation table11 
GC content74% 
IMG OID641244428 
Productacetamidase/formamidase 
Protein accessionYP_001509832 
Protein GI158317324 
COG category[C] Energy production and conversion 
COG ID[COG2421] Predicted acetamidase/formamidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.49831 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCTGCG ACGGCCGCCG TCGGCACGGA CGGTCCCGGG GCGGGCGGAA CAACGAGGAG 
TGCGACATGT CCGTCCTGCA GTGCGGGTCC GGCGAGGTAC CGGGGGAGCA CTACCTGCCC
TCGTCGCCCA AGACGGTCAC CTGGGGCCGG CTGCCCAGCG CGGCGACCGA CCCGGTGCTC
GAAGTGGCGG CGGGCGCCAC CGTCACCATC GACACCGTCT CGCACGAGGG CCTCATGGAG
GACCAGGGCC GCGACCCGGT CGCCTTCTGG GCGGAGCACG CCGTCGCGGC GGATTCGGTC
CTCCTCGACG CGATCGACAT CGCCCGGGAC GTCTCCCACT ACCGCGGGCT CGACGGCCCG
CACGTGGTCA CCGGTCCGGT GCGGGTCGAC GGCGCGCGCC CCGGCGACAT CCTCAAGGTC
GAGTTCCTCG AGCTGCGCCC GCGGGTGCCC TACGGGCTCG TCTCCAGCCG GCACGGCCGG
GGCGCGCTGC CCGGCGAGCT TCCCGTCGGC CCCGACGGCG TGCTCGCCGA CCGTTACAGC
CAGTTCTGCG AGGTGGACGT CGCCGCCGGG CGGGCCGTGA TGCGCTACGG CGAGGGCCGG
CAGATCAGCT TCCCGGTCGC GCCGTTCATG GGTCTCACCG GCCTCACCCC GGCCGGGGAG
AAGGCCCTCA ACACGACCCC GCCCGGGGCA TTCGGCGGGA ACCTCGACGT GCGCGACCTC
GTCGCCGGGT CGACGCTCTA CCTGCCGGTC CAGATCCCGG GCGCGGGCTT CTACACCGGT
GATCCGCATT TCGCCCAGGG GCACGGCGAG GTGTCGCTGA CCGCCCTGGA GGCCTCGCTG
CGCACGACCG TCCGGCTCAC CCCGCTTCCG GCCGCCGCGG CGCTGCCCTT CGGCGCGGGT
TCCGGCGGCC CCTTCGGCGA GACCCCGGAG CACTGGATCG CCATCGGCCT GCACAACGAC
CTCGACGAGG CCATGCGGCT GGCCGTCCGC GAGGCGCTGC GGGTGCTGCG CCAGGTCCGG
GATGTCCCGG TGATGGTCGC GTACAGCTAC CTGTCGGCCG CCGCCGACTT CGTGGTCAGC
CAGGTCGTCG ACGACGTGAA GGGCGTGCAC TGCCTGATCC GCAAGCGCGA CTTCCCGGCC
TGGTAG
 
Protein sequence
MSCDGRRRHG RSRGGRNNEE CDMSVLQCGS GEVPGEHYLP SSPKTVTWGR LPSAATDPVL 
EVAAGATVTI DTVSHEGLME DQGRDPVAFW AEHAVAADSV LLDAIDIARD VSHYRGLDGP
HVVTGPVRVD GARPGDILKV EFLELRPRVP YGLVSSRHGR GALPGELPVG PDGVLADRYS
QFCEVDVAAG RAVMRYGEGR QISFPVAPFM GLTGLTPAGE KALNTTPPGA FGGNLDVRDL
VAGSTLYLPV QIPGAGFYTG DPHFAQGHGE VSLTALEASL RTTVRLTPLP AAAALPFGAG
SGGPFGETPE HWIAIGLHND LDEAMRLAVR EALRVLRQVR DVPVMVAYSY LSAAADFVVS
QVVDDVKGVH CLIRKRDFPA W