Gene Francci3_2929 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2929 
Symbol 
ID3903993 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3450326 
End bp3451987 
Gene Length1662 bp 
Protein Length553 aa 
Translation table11 
GC content75% 
IMG OID637880250 
Product2-nitropropane dioxygenase, NPD 
Protein accessionYP_482016 
Protein GI86741616 
COG category[R] General function prediction only 
COG ID[COG2070] Dioxygenases related to 2-nitropropane dioxygenase 
TIGRFAM ID[TIGR02814] PfaD family protein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0103438 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.46056 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACGA CGACAACGAC GGTCCGTGGC GCGCCGGCGA CGCCGCGCCC TGGATCGGGC 
GGGCCGCGGC TCGCGCGATC CGCCCAGGAG ATCCACGACC TGCTCGCCCG GCTGGACGCG
CCCTGCGTCG TGGTGAGCGA ACAGGGCGGC GCGATCGCGG CCACCGACGA CCCGGCGAGC
CTGCGCGCGG CGGGGGCGAC GGTGCTGGCT GTGGCGCCGC CGGCCCGCCC CGAGCGGCTG
GGAGCCGCGT CCTTCCTGAC CGACTACGGG GTGCGTCAGC CCTACATGAC CGGCGCGATG
GCGAACGGCA TCGCTTCGCC GGAGCTGGTC GTGGCGATGG CACGGGCGGG CTTCCTCGCC
ACGTACGGGG CGGCCGGCGT GCTGCCCGAC CGGATCGACG ACGCGCTTGG GCGGATCCGC
CGTGAGCTCG GCCCGGCGCC CTTCGCCTGC AATCTGATCC ACAGCCCGAA CGAGCTGGAG
CTGGAACGGG CCATCCTGGC CGCCTGCCTG CGCCACGGGG TGACCTGCGT GGAGGCGTCC
GCGTTCCTGG AGCTGACCCC GCAGATCGTC GCCTACCGGG CCGCGGGGCT GCGGCCGGGC
GGCGCTGGCG GCGTACACGT CGGGCACCGG GTGGTGGCCA AGGTCTCCCG CGGGGAGGTG
GCCGAGCTCT TCCTGCGCCC CGCCCCGGAC GCGCTGCTGC GCCCGCTGGT GGCGGATGGC
ACCCTGACCG CCGAGCAGGC CGCGCTCGCC CGCACGGTGC CGATGGCCGA CGACATCACC
GTCGAGGCGG ACTCCGGCGG CCACACCGAC CGCCGGCCGC TCCCGGTCCT GCTCCCCGAG
ATCATCGCGG TGCGCGACCG GATCGCCGCC GAGCTCGGCT ACCGCCGCCC GCCGAGAGTG
GGAGCCGCGG GTGGCATCGG TACGCCATCG GCGGTGTTCG CCGCGTTCGC GCTCGGCGCA
GCCTACGTCG TCACCGGTTC GGTGAACCAG GCGTGCGTCG AGTCCGGTCA GTCGGCGGCG
GCGCGGGCGC TGCTGGCGAA GGCGGGCCCG AACGACATCG ACATGGCGCC GGCCTCCGAC
ATGTTCGAGA TCGGCGCCGA GGTGCAGGTC CTGCGCCGCG GCACGATGTT CGCCGGGCGG
GGCCGCCGGC TGTACGACCT CTACCGCGCC CACGACTCCC TCGACGACCT TTCGGCGGAG
GATCGGAACT GGCTGGAGCG TTCGGTCCTG CGCCGGTCCG TGGACGAGGT ATGGGCCGAC
ACCGTCGACT ACTTCAGCCG GCGCGACCCG GAGCAGATCG AACGCGCGCA GGCCAACCCG
AAGAGACGGA TGGCGCTGGT GTTCCGCTGG TATCTCGGGC TGTCCTCGGG CTGGGCGATC
TCCGCCGCGC CCGACCGGAT CACCGACTAC CAGATCTGGT GCGGCCCGTC CCTGGGCGCC
TTCAACACCT GGGCGGCCGG CAGCTACCTG GCGGACGTCG ACCGGCGCAG CGCGGTGGAC
GTCGCGGGTG AGCTGATGCT CGGCGCCGCC TACACCGGGC GGGCCGCGGC GCTGCGGTTC
GCCGGGGTGC GGCTGCCGGC GCGGGCGGCC GCCTACCGAC CGCCGGCCAC GCGCGAGAGC
TCGCCGGCGC ACCGGTACGT CCTGACGGCG GGTGCCCGGT GA
 
Protein sequence
MVTTTTTVRG APATPRPGSG GPRLARSAQE IHDLLARLDA PCVVVSEQGG AIAATDDPAS 
LRAAGATVLA VAPPARPERL GAASFLTDYG VRQPYMTGAM ANGIASPELV VAMARAGFLA
TYGAAGVLPD RIDDALGRIR RELGPAPFAC NLIHSPNELE LERAILAACL RHGVTCVEAS
AFLELTPQIV AYRAAGLRPG GAGGVHVGHR VVAKVSRGEV AELFLRPAPD ALLRPLVADG
TLTAEQAALA RTVPMADDIT VEADSGGHTD RRPLPVLLPE IIAVRDRIAA ELGYRRPPRV
GAAGGIGTPS AVFAAFALGA AYVVTGSVNQ ACVESGQSAA ARALLAKAGP NDIDMAPASD
MFEIGAEVQV LRRGTMFAGR GRRLYDLYRA HDSLDDLSAE DRNWLERSVL RRSVDEVWAD
TVDYFSRRDP EQIERAQANP KRRMALVFRW YLGLSSGWAI SAAPDRITDY QIWCGPSLGA
FNTWAAGSYL ADVDRRSAVD VAGELMLGAA YTGRAAALRF AGVRLPARAA AYRPPATRES
SPAHRYVLTA GAR