Gene Francci3_2242 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2242 
Symbol 
ID3905010 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2614652 
End bp2616883 
Gene Length2232 bp 
Protein Length743 aa 
Translation table11 
GC content66% 
IMG OID637879573 
Producthydantoinase B/oxoprolinase 
Protein accessionYP_481339 
Protein GI86740939 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0146] N-methylhydantoinase B/acetone carboxylase, alpha subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0203545 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCCAGC CCGACCACGC TGACGGCAAG TCAGTCGTCG AGAAGTTCCT CGAAGAAAAC 
GTCCTGTTCC TCGGGCCCGA CCCCGAGATC ATGCAAAGCC ACCATCTCGC CCCGGAGTCG
CCCCGCGAAA CGGAGGCGCT CGCGCGGTTC ACCGATCCCG AGCAGATCAA CCTGGTGCGG
CACAAGCTGC AGACCGCGTG TAACGAGTCC TTCGACATGG TCGAGCAGAT GGGCGCGGCG
CCCGGCGCGA AGTGGGGCGA CCTGATCTCC GGGGTGTGGA CGGCCTCGGG TGACCTCGCC
CTGTCCAGCA TGGGTGGTGT GCTGCTGTTC TCGGTGCTCA CGCAGCACCC GGTCAAGTTC
ATTGTCAAGT ACTGGGTGGA TGAGCCGACG GTCGGGGTCC GTGAGGGCGA TGTCTTCATG
CACAACGACG CCCGCTACGG CAACGTCCAC AACACCGACC AGAGCATCCT CATCCCGGTC
TTCCACGAGG GGCAGCTGAT CTGCTTCGCC GGCGCGGTCT GCCACGAGGG GGAGAACGGC
GCCACCGAAC CCGGCGGTAT GCCCTCGGCG GCGGAGTCCC CGTTCGACGA GGGATTGAAG
ATTTCGCCGA TCAAGGTCGG TGAGAACTAC ACGTTCCGCC GTGACCTCAT GACCTTCCTG
CAGAACTCGG TGCGCGAGCC GAAGCTGCAG CTGGAGGGCA TGAAGGCCAA GCTGTACGCG
GCGATGCGGA CAAGGGACCG GATCCACGAC ACCATCGCCG AGTACGGTGT CGACGCGGTC
GTCGCCACGT TGCGGCGCAC CCTGACCGAC ACCGCCGACG AGGTCCGCCG CCGGCTGAGG
TCGTGGCCGG AGGGCACCGT GCGGCAGAAC GTCTTCGCGG ACGGGACGCT GCGGGAGAAC
TGCCTGGTCA AGATCAGACT GGCGATGACC AAGAAGGACG ACGAGCTGAT CCTGGATTTC
CGGGGCTCGT CGCCGGAATT CCTGAACCGG GCCAACAACA CGATCCTCTC GTCGATGAAG
GGCATGCTGG CCCAGGAGTT CCTGACCTTC GTGTGGCCGG ACCTGCCGCG AAACCAGGCG
GTGTTCGAGC CGATGACTGT TCTCACCGAC CCGCGGTCGG CGCTGAACTG TTCGCCGGAG
GCGCCGAACG CGCAGAGCAT GATGACGTTC TTCCCCTCCT TCACCGCGGC CCAGCTGGCG
ACGCCGAAGC TCCTCTACAG CGCGGGTGAA CGCTCTACGG ACGTCATCGC CGGCTGGTTC
AACATGATCG TCACGTTCAT CTACGGTGGT GTCACCCAGC ACGGCGAGCT GGTGGGCAAC
GTGTGCGCCG ACCTCAACGG CATGGGCGGG GCCGCGCGGT CCAACCGGGA CGGCGAGCAC
GCGGTCGCCC CGATCTTCGC GCCCATGGCC GACATCGGGG AGCAGGAACT CATCGAGGAA
GAAGTCCCGA TTCTCAAGAT CGTGCCGAAC AGGGTAATGC GGGACAACCA GGGCTTCGGC
AAGTTCCGCG GCGGCCAGGG CTACCAGCAG ATCGCCACCG TCAAGGACAG CGCCATGTGG
GGCTTCATGG CGTGCAGCAT CGGCTCGAAG TTCCCCAGCT CGCACGGCAT CTTCGGCGGC
TACGGCCCCG GCACCTACCC GCTGTGCAAG ATCAAGAATG TCGACATCTT CAAGGTGATG
GACAACCAGC GCGAACTGCT GCGCTACACC GTCGAGGAAC TGATGAACGA GCGTCCGTTC
CCGGACGCGA CCTACTCGAC GCATCACATG GGCATGCAGT TCGAGCTCGC CGAGCGCGGC
GAGCTGTACA TGCTCACCCA GGGCACCGGC GGTGGCTACG GGGACGTGCT CGAACGCGAT
CCCGAGCTGG TGGCTTCCGA CTACCGCGAC GGCCTGGTGT CCATGGACAC CGTCCGCGAC
ATCTACCACG TGGTGCTCAA CCCCGACACC GCCGTGCTGG ACGCCGAGGG CACCACGGCG
GCCCGGGAGG CCGAGCGGGC GGCTCGGCTG CGGCGGGGCA AGCCGTATGC GGAGTTCGTC
AAGGAATGGG AGACCGAGAC GCCTCCGGCC GACGTGCCGT TCTTCGGCTC CTGGGGCGAC
CCGCGGGTGC TGTTCCGCGG CACTCCGCAG GACACCTGCC CCGCGGACGC GATCGTGCCG
GTGATGATGC CCGACCCGAA GGACGTGCGG ATCGCGCAGC TGGAGGCCAA GCTGGCGGAG
CTGCAGGACT GA
 
Protein sequence
MSQPDHADGK SVVEKFLEEN VLFLGPDPEI MQSHHLAPES PRETEALARF TDPEQINLVR 
HKLQTACNES FDMVEQMGAA PGAKWGDLIS GVWTASGDLA LSSMGGVLLF SVLTQHPVKF
IVKYWVDEPT VGVREGDVFM HNDARYGNVH NTDQSILIPV FHEGQLICFA GAVCHEGENG
ATEPGGMPSA AESPFDEGLK ISPIKVGENY TFRRDLMTFL QNSVREPKLQ LEGMKAKLYA
AMRTRDRIHD TIAEYGVDAV VATLRRTLTD TADEVRRRLR SWPEGTVRQN VFADGTLREN
CLVKIRLAMT KKDDELILDF RGSSPEFLNR ANNTILSSMK GMLAQEFLTF VWPDLPRNQA
VFEPMTVLTD PRSALNCSPE APNAQSMMTF FPSFTAAQLA TPKLLYSAGE RSTDVIAGWF
NMIVTFIYGG VTQHGELVGN VCADLNGMGG AARSNRDGEH AVAPIFAPMA DIGEQELIEE
EVPILKIVPN RVMRDNQGFG KFRGGQGYQQ IATVKDSAMW GFMACSIGSK FPSSHGIFGG
YGPGTYPLCK IKNVDIFKVM DNQRELLRYT VEELMNERPF PDATYSTHHM GMQFELAERG
ELYMLTQGTG GGYGDVLERD PELVASDYRD GLVSMDTVRD IYHVVLNPDT AVLDAEGTTA
AREAERAARL RRGKPYAEFV KEWETETPPA DVPFFGSWGD PRVLFRGTPQ DTCPADAIVP
VMMPDPKDVR IAQLEAKLAE LQD