Gene Franean1_3361 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3361 
Symbol 
ID5671732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3984977 
End bp3986377 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content74% 
IMG OID641242249 
Productbeta-lactamase domain-containing protein 
Protein accessionYP_001507669 
Protein GI158315161 
COG category[P] Inorganic ion transport and metabolism
[R] General function prediction only 
COG ID[COG0491] Zn-dependent hydrolases, including glyoxylases
[COG0607] Rhodanese-related sulfurtransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.655276 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGATTG CCGTTGAGGT CGTGGAGACC TCCACGCTCG GCGACCGCAG CTACCTGGCG 
CACGACGGCC AGCTGGCGGT GGTGGTCGAC CCGCAGCGTG ACATCGACCG GATCCTGGCC
CTGGCCGGCG GGATCGGCGT GCGGATCAGT CACGTGGTGG AGACCCATCT GCACAACGAC
TACGTCTCGG GCGGGCTCGC GCTGGCCCGG GTGACCGGCG CCCGCTACGG GGTGGCGGCG
GCCGACGACG TCCACTTCGA CCGTCTGCCG TTGTCCGACG GCGACGAGAT CACGATCTCC
GACGAGGCCG CGATGACGGT GGTGGCCACC CCCGGCCACA CCTTTCATCA CCTGTCGTTC
CTGCTCCACG GCCCGAAGGG CCCGGCCGGG GTGTTCACTG GCGGGTCGCT GCTGTTTGGT
ACCACCGGCC GCACCGACCT GCTGGGCGCC CAGCACGCGC ACACCCTTGC CCGGCACCAG
CACGCGTCGG CGCGGCGGCT GGCCGACCTG CTGCCGCAGG GGGCGCAGGT CTGGCCGACG
CACGGCTTCG GCAGCTTCTG CTCGGCCAGC CAGTCCGACG TTCCCGCGTC GACCATCGGA
CAGGAACGGA CGATCAATCC GGTGCTGCGG CTGGCGGCGG ACGACTTCGT CACCGAGGTC
CTCGCCGGCC TGGACGCGTT CCCCGCCTAC TACGCCCACA TGGGCGCGCG GAACGCCGGC
GGGGCCGACC TGGTGGATCT CACCCCGGTC AGGGTCGCCG ACGCGGGCGA GCTGCGCTCC
CGGATCACGG CCGGCGAGTG GGTGGTCGAC CTGCGGTCCC GGAAGGCCTT TGCGCACCGG
CACCTGACCG GGACGCTGAG CCTCGGCCTG GACGGTCCGA TGTCGACCTG GCTCGGCTGG
CTGATGAGCT GGGGGATGCC GGTGACGCTG CTCGGCGAGT CCGACGCCCA GATCGGCCAG
GCACAGCGGG AGCTGGCCCG CATCGGCATC GACCGCCCGG CGGCCGCCGC GACCGGCACC
CCCGAGCAGT GGGCCGCGGG CGACCAGTCG CGCCTGGGCC CGCTGCCTTC GGCGACGTAC
CCGGAGCTGG CCGCGGCGCT GGCCGGACGT CCGCCGCGTG GCCTGCCCGC TCCCGATGTC
GTGCTGGACG TCCGGCTGCG CAACGAGTGG CGGAACGGAC ACCTTGCCGG GGCCGTGCAC
ATCCCGCTGC CCGAGCTGCC CGGCCGGCTC GGGGAGGTGC CGGGCGAAGC CGTGTGGGTG
TACTGCGGGT CCGGGTACCG CGCCGCTGCC GCCGCCAGCC TCCTTGCCCG CGCCGGCCGC
ACCGCGGCGT TCATCGACGA CTCCTACACC GCTGCCGCCG ACGCCGGCCT GCCGATCGTC
TCCGGGGAGG GAGCGGAATG A
 
Protein sequence
MTIAVEVVET STLGDRSYLA HDGQLAVVVD PQRDIDRILA LAGGIGVRIS HVVETHLHND 
YVSGGLALAR VTGARYGVAA ADDVHFDRLP LSDGDEITIS DEAAMTVVAT PGHTFHHLSF
LLHGPKGPAG VFTGGSLLFG TTGRTDLLGA QHAHTLARHQ HASARRLADL LPQGAQVWPT
HGFGSFCSAS QSDVPASTIG QERTINPVLR LAADDFVTEV LAGLDAFPAY YAHMGARNAG
GADLVDLTPV RVADAGELRS RITAGEWVVD LRSRKAFAHR HLTGTLSLGL DGPMSTWLGW
LMSWGMPVTL LGESDAQIGQ AQRELARIGI DRPAAAATGT PEQWAAGDQS RLGPLPSATY
PELAAALAGR PPRGLPAPDV VLDVRLRNEW RNGHLAGAVH IPLPELPGRL GEVPGEAVWV
YCGSGYRAAA AASLLARAGR TAAFIDDSYT AAADAGLPIV SGEGAE