Gene Francci3_4132 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4132 
Symbol 
ID3907097 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4932998 
End bp4934248 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content68% 
IMG OID637881460 
ProductXRE family transcriptional regulator 
Protein accessionYP_483209 
Protein GI86742809 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATCG TTGGCGGGAC GGGTTCGGAT GACGCGCTGT CCGTCGGTCG ACGGCTGAAG 
GTTCTGCGCA CGCGGCGGGG GATGACGCGG GAGGTTCTGG GCGGGCTGGT CGGTCGCTCG
GCGTCGTGGG TGAAGGCGGT GGAGACGGGT CGGTTGGCTG CTCCGAAGTT GTCGATGTTG
CTTCGGTTGG CGGAAGCACT CAGGGTGCGT GACCTTGCGG AACTGACGGG TGGTCAGTCG
ATCCCGGTGG TGTTGTTCTC CGGCCCAGGG CATGATCGGC TCACTGCTGT GCGGGCTGCT
GTCAACAGGT TGCCGGTCTC AGCCGCCGAT CAGCCTGCGC CCTCGGTTGC AGATCTACGG
GGGCGGGTGA CCTGGGCCTG GAGGGCGAGG CATGCGGCGC CGAATCATCG GGAGGTTCTC
GGGGGGCTGT TGCCCGGGCT TCTGGATGAC GCGCAACGCA CTGCCCGCGC GGAGGCGGAC
GGTCCGCAGC GCCGTGCGGC GCTGGCCGTG TTGGCGGAGG TGTACGCGCT GACGCAGTTT
TTCGTGTCCT ACCAGCCCGC ACAGGATCTA GTCTGGCGGG TGGCGGAACG TGGTGTCTCG
ACCGCGCTGG ACTCGGACGA CCTGCATGCG GTCGGGGTGG CAGCCTGGCT GATGACACAG
GCGCATCGTG AGGCCGGGGA CTGGGATGCA GCGGACGTCG TGGCCAGTCA GGCGACGGCG
CTACTGCGGG ACAGCCTGTC CAGCGATGAC GCCACTGACG ACGTGGCGGC GCTGTGGGGA
GCGTTGCAGT TCGAGAGCGG CTACACGGCG GCGCGGCGTG GCGAGATCGG GAACGCTTGG
AGGTACTGGG ACGCGGCGGA CGCCGTCGCG CGGCGACTGC CGGACGACTA CTTTCATCCT
GTCACGTCGT TTTCGCAGAC GGTCATGCAC GCGCACGCCG TGACGGTTGC GGTCGAGCTG
CGACAGAGTG GTGAGGGCGT ACGACAGGCG GAACGGTGGC GCGCGGCGGT GATCCCGTCC
CATCCGCGGC AGGCGCGGCA TTGGATCGAG CAAGCACGGG CTTACCAGAT CGACAGGAAG
TACGACGAAG CGCTTCGTCT CCTCGATCAC GCCTACGACT CGGCGCCGGA GACGATCCGG
TACAACGGCC ACGCGCGGCG GATCATTCTG GAAGAGCTGG ACGCGCGGGA TGGACGGCGT
CGGGAGCAGG CAAGCGAGCT GGCCCGGAAG GTGGGTCTGT TGGGGGTATA G
 
Protein sequence
MAIVGGTGSD DALSVGRRLK VLRTRRGMTR EVLGGLVGRS ASWVKAVETG RLAAPKLSML 
LRLAEALRVR DLAELTGGQS IPVVLFSGPG HDRLTAVRAA VNRLPVSAAD QPAPSVADLR
GRVTWAWRAR HAAPNHREVL GGLLPGLLDD AQRTARAEAD GPQRRAALAV LAEVYALTQF
FVSYQPAQDL VWRVAERGVS TALDSDDLHA VGVAAWLMTQ AHREAGDWDA ADVVASQATA
LLRDSLSSDD ATDDVAALWG ALQFESGYTA ARRGEIGNAW RYWDAADAVA RRLPDDYFHP
VTSFSQTVMH AHAVTVAVEL RQSGEGVRQA ERWRAAVIPS HPRQARHWIE QARAYQIDRK
YDEALRLLDH AYDSAPETIR YNGHARRIIL EELDARDGRR REQASELARK VGLLGV