Gene Francci3_2237 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2237 
Symbol 
ID3905005 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2608128 
End bp2610044 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content68% 
IMG OID637879568 
Producthydantoinase/oxoprolinase 
Protein accessionYP_481334 
Protein GI86740934 
COG category[E] Amino acid transport and metabolism
[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.570918 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.102983 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGAATCC TGGTCAATAT CGACAACGGT GGCACGTTCA CCGATGTGTG CGTGACGGAC 
GGTGAGCGCA TCGTGCATGC GAAGACGCCG ACGACCCCGC ACGATTTAAC GCAGTGTTTC
GTCGACGGGC TGCGGACAGC GTCCGACCGG CTGTATGGCG AGGAGGACAC CGCTCGTCTG
TTGCGCGAGA CGGAGTACCT GCGCTATTCG ACGACGTCGG GTACGAACGC CGTGGTCGAG
CGGAAGGGCG CACCGGTCGC CCTGCTGGTC GACAGCGGTG CGGAGGAGGA CGTCTACGGC
ATTGCGAACC TGGTCGACGC CTCGCTGTGG CAGGCGCTGG TGCCGCATTC TCCGGTCGGG
ATCACGGTGG GCGCCGACGG GTCGGTGGGC TTGGCGGAGT TCACGACGGC AATCAACGAA
CTGTTGGCGA CGAGTACCTC GCGGATCGTG ATCGCGCTGC GGAGCGCGGC GGCGGAGCGG
GCGATCAAGA ATCTGTTGTT GGAGCGGTAC CCGCGGCATC TGCTGGGCGC GGTTCCGTTC
ACCCTGTCGC ATGAGCTTGT GCACGACGTC GACGACGCGC GGCGGGTGCT GACCGCGGTG
CTGAACTCCT ACCTGCACCC GGGGATGGAG CACTTCCTCT ACGGCGCGGA GAAGGCGTGC
CGGGAGAACG GGCTGCCTCG CCCGCTGCTG ATCTTCCGGA ATGACGGGGA CTCGGCCCGG
GTGGCCAAGA CCACGGCGTT GAAGACGTGG GGTTCCGGGC CGCGGGGTGG GCTGGAGGGC
AGCGTCGCCT ACGCCTCGCT CTACGGCGCG GACGTGCTGG TCGGGGTGGA CGTGGGTGGC
ACCACGACCG ACGTGTCGGT CGTGGTGGAC AAGGCCCTGA CGGTGCACGC GCACGGCCGG
GTCGATTCGG CGCAGACCTC GCTGCCGATT CCGGACCTGA GCAGTATCGG GCTGGGTGGC
AGTTCGGTGG TCCAGGTCGT CAACGGTCAG ATTCAGATCG GTCCGCGCAG TGTGGGTGCC
GCGCCCGGCC CGGCGTCTTT CGGCCGTGGT GGCACCGATG CGACGGTGAC CGATGCCCTG
CTGCTTGCCG GGGTGTTGGA CCCGGACAAC TATCTGGGTG GGGATCTGAA GTTGGACCCG
GCCCGGGCGG AGCGGGCGTT GCTGACCCAT GTCGGGGAGC CCCTGTCGTT GTCGGCGCAG
GCCGCGGCGC TCGCGGTGTT GCGGGTGTTC GAGGAGCAGG CGGGGGCCGC GGTCAAGGAG
ATGATCTCCG CGGCGGGTCG TGAGCCGGGT GAGGCCACGC TGCTGGCCTT CGGTGGCGCC
GGTCCGGTGC TTGCCTCGGG GATCGCGCGG GCGGCGGGGA TCGCACGGGT GATCGTGCCG
CATCTGTCGG CGGTGTTCAG CGCGTTCGGT ATCGGTTTCA GCGGGTTGGC ACACGAATAC
AGCGTGCCGA TGCCGGGCGT CGACGTCGAG GTGAAGGCGG CCCGCGACGA TCTGTTGACC
CGCGCGCGGC GTGACATGTT CGGTGAAGGC GTGTCCATCG ACGAATGCAC TGTCGAGACC
CGCGGCCGAT TCATCGTCGA CGGCGTGTTG CGGGACGAGA CGTGGACCGA CGGGTCGTCT
CCCGGGCAGG CCGATCAGCT GGTGGTGCGG GCCTGGTATC CGCTGCCGAC CTTCGAGCTG
GTGGCCGACG AACACGGCAC GGTCCAGCCG GCGGCCGCGG ACGGATCGCG TCGTATCCAT
TTCGCTGACG GTAACGAGCA GGAGATTGCG GTCTACCGTC CGGAGAATCT GGAGCCGGGG
CAGGGGGCCG CGGGGCCGGC GCTGGTCGCG GGTGACTATC TGACCTGTCT GATCGAGCCC
GGGTGGGGGT TCCGGGTGAG CAGCAACTCT GACCTGATCC TGGAGGCCCA GCAGTGA
 
Protein sequence
MGILVNIDNG GTFTDVCVTD GERIVHAKTP TTPHDLTQCF VDGLRTASDR LYGEEDTARL 
LRETEYLRYS TTSGTNAVVE RKGAPVALLV DSGAEEDVYG IANLVDASLW QALVPHSPVG
ITVGADGSVG LAEFTTAINE LLATSTSRIV IALRSAAAER AIKNLLLERY PRHLLGAVPF
TLSHELVHDV DDARRVLTAV LNSYLHPGME HFLYGAEKAC RENGLPRPLL IFRNDGDSAR
VAKTTALKTW GSGPRGGLEG SVAYASLYGA DVLVGVDVGG TTTDVSVVVD KALTVHAHGR
VDSAQTSLPI PDLSSIGLGG SSVVQVVNGQ IQIGPRSVGA APGPASFGRG GTDATVTDAL
LLAGVLDPDN YLGGDLKLDP ARAERALLTH VGEPLSLSAQ AAALAVLRVF EEQAGAAVKE
MISAAGREPG EATLLAFGGA GPVLASGIAR AAGIARVIVP HLSAVFSAFG IGFSGLAHEY
SVPMPGVDVE VKAARDDLLT RARRDMFGEG VSIDECTVET RGRFIVDGVL RDETWTDGSS
PGQADQLVVR AWYPLPTFEL VADEHGTVQP AAADGSRRIH FADGNEQEIA VYRPENLEPG
QGAAGPALVA GDYLTCLIEP GWGFRVSSNS DLILEAQQ