Gene Francci3_4273 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4273 
Symbol 
ID3907240 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5101068 
End bp5102588 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content70% 
IMG OID637881599 
Productputative plasmid replication initiator protein 
Protein accessionYP_483348 
Protein GI86742948 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.730737 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGTCC CTGTGCTGGA TTCGGAGTTC CTGCGCGCGG CGATGCCTCG CGCTGCGGAC 
GGCTCCCTGT CCGGGTTTCT CGCGCAACTC GACCGGGTGT CCGGCTGCGC GCACCCTGTC
CGCCTGTTCG GCCACATCGA CCATGTGGAT ACGGCTACCG GTGAGGTGCG CCGCGTGCTG
GACACGGCGG ACATGCCGGA CGGGCTGCTC CTCGTGCCCT GCGGGAACCG GCGGGCCTCG
GTGTGCCCGT CCTGCTCCTA CTTGTACGCG GGTGACGCCT GGCAGATCGT CCACTCCGGT
CTGACCGGTG GCCATGATGT TCCCGCGACG GTGGCGGAGC ATCCGGGGCT GTTCGTCACG
GTGACCGCAC CCTCGTTCGG CCCGGTGCAT TCCCGCCGGG CGAACCACGG TCCCGCTCAG
GTCTGCAACC GGCGCCAAGG ACGATGCCCG CATGGCCGTG CGCGCGGCTG CGGCTTCGTC
CACGGCGAGG ATGACTGGCG GCTCGGCACC CCGCTGTGCC CGGACTGCCA CGACGACGGC
GCGCTGATCG TCTGGAACCG CACCTCGGGC AGGCTATGGA AGCGCACCGT GGACGCGACC
TACCGGCGCA TGGCCACCGT CACGGGCATG ACCGAACGGG GCTTGCGGCG TGCGGTCCGC
ATCTCCTACG TCAAGGTCGG GGAAGCCCAA CGGCGCGGCG CGGTCCACTT CCATGCCCTG
CTGCGCCTGG ACGCGGGCAG CGACTCGGAC GTCTGGCAGG CTCCTCCCGC CTGGGCAACC
GGGGACCTGC TGGCGTCCTG CTGGCGCTGG GCGGTCACCC ACACCGAGGA ACCCTGCCCG
GACCCGCACG CCTATGACCT TCCCTCCGAA GCGGACGCGC TCGCCACCCT GGCGGCTGCG
GGCCTGCTCG CCTCCGCGGA CATCCCCCCA CCTGCGCCAC GCCCGGCACC CATGCGGGCG
GCTCGCTGGG GTGAGCAGCA TGATGTGAAA AAGGTGTTGG TGGACGGCGG GGACCTGACC
CCGCAAGCGG TCGGGAACTA CCTCGCCAAG TACATCACCA AGTCCGTCGT GGACTCCGGC
CTCCTCGACC GGCGTATCCG CGACGCTGAC GACTTGGAGG CACGCCTTCC GTTCCTGGGA
GAGTTCCACG CCCGGCTGGT CCATACCGCG TGGCGTCTCG GTCCGCACCC GGAGTTTCAG
GATCTACGGC TGCGCCAATG GGCGCACCAA CTCGGCTTCA ACGGCCATTG GCTCACCAAA
TCACGCGGCT ACTCCACCAC CTTCGCGGCA CGCCGGACGG CCCGGCGTAC CTGGCGGCGC
ACCCACGTGA ACGGCGAGGA ACGCCCGGCC CTCGACGCCT TCGGCCGACC CGACGACACC
GACGAAACAG TGATCATCCC CACCTGGCGG TACCACTCCT CCGGCTACAC CAACAACGGC
GACCGATGGC TCGCATCCAT GGCCGCAGAC CAAGCCCGGT CACGGAGAGA GGTTACCCGT
GATCATCGTT CGGCGGCGTG A
 
Protein sequence
MTVPVLDSEF LRAAMPRAAD GSLSGFLAQL DRVSGCAHPV RLFGHIDHVD TATGEVRRVL 
DTADMPDGLL LVPCGNRRAS VCPSCSYLYA GDAWQIVHSG LTGGHDVPAT VAEHPGLFVT
VTAPSFGPVH SRRANHGPAQ VCNRRQGRCP HGRARGCGFV HGEDDWRLGT PLCPDCHDDG
ALIVWNRTSG RLWKRTVDAT YRRMATVTGM TERGLRRAVR ISYVKVGEAQ RRGAVHFHAL
LRLDAGSDSD VWQAPPAWAT GDLLASCWRW AVTHTEEPCP DPHAYDLPSE ADALATLAAA
GLLASADIPP PAPRPAPMRA ARWGEQHDVK KVLVDGGDLT PQAVGNYLAK YITKSVVDSG
LLDRRIRDAD DLEARLPFLG EFHARLVHTA WRLGPHPEFQ DLRLRQWAHQ LGFNGHWLTK
SRGYSTTFAA RRTARRTWRR THVNGEERPA LDAFGRPDDT DETVIIPTWR YHSSGYTNNG
DRWLASMAAD QARSRREVTR DHRSAA