Gene Francci3_3493 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3493 
Symbol 
ID3905227 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4164490 
End bp4166271 
Gene Length1782 bp 
Protein Length593 aa 
Translation table11 
GC content76% 
IMG OID637880815 
Producthypothetical protein 
Protein accessionYP_482575 
Protein GI86742175 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.569085 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.623297 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGAGC TGCGGGCGGT CGCACTCAGC GAGGACGGCG GCTACCTCGT GCTCACCGAC 
GCGAACGGCA GGACGGACGG CGAGCAGTTC CGGGTGCCCG TGGACGACCG GCTGCGCGCG
GCCCTGCGCG GTGTGCGTCG CAGCGAGGTG CGTACCGAGA GCGCGCTCAC CCCACGGGAG
ATCCAGGCCA GGCTGCGCGC CGGGGAGACC GCGGCGGAGG TGGCCAGGGC CGCCGGGATC
CCGGTGGAAC GGGTGGAGCG CTATGAGGGT CCCGTCCTCG CCGAACGTGC CAGGGTCGTG
CAGGAGGCGC GGGCCGCGCT GCTCCCCAAG GATCCGGGGG GCGTGCCGGG ACGGCCGTTG
GGCGAGGTCG TCGACGCGCG GCTGATGGGA GCCCAGGACA ATCCGGCCGC GGCGCAGTGG
GACGCGTGGC GCCGCGTCGA CGGCATCTGG CTCGTCCAGC TCACCTCGGA AAGCCGCTGC
GCCCGCTGGA CGTGGGATCC GGTGGTGCGC CGGGTGCGCC CACATGACGA CGCGGCCCGC
ACTCTTGTGG CGCCGGAGAC GGCAGAGCAG CCGGCGCCGG TGCCCGCACC GTCATCCGTA
CCATCATCGT CGGCGGCGCT GCGGGCGGCG GGGCCAGCGC TCACCCTTGT GCACGACCAG
GGCATCCCCG CCACCCCGGC CACCCCGGTC CAGCCGATGG TCGCTGCGAT GGCGGAATCG
GGCCTGCCGC CGGGAACACC GGCCCCGCAG CACGGCCTCC CCGAGCGACA GTACCCTCCG
CCCGCGGCCC TGCCCTCCCA CCCGGCCGCG TCGTATCCCG GGGATAACGC ATATCCCGGG
GATAACGCGT ATCCCGGGGA CACCGCGGGC GGTGGGATCG CGCCCGGGTC GCCCGGGTCG
CCCGGGTCGC CCACCCGTCC GGAGACGATC CAGGCGGCCG AACCGCAACC CGTGGTCGCG
GACACCCGAT CCGCGGACAC CCGAGCCGTG ACCCCCTGGT CCGTACCTGC CCGGTCCGCA
GACCCAGCGG CCCCGGGAGA CCCGCGCGCG GTGGAGCGGA AGGCCGCCCG GGCCGAGGAA
GCCGAGTCCC TGGATCGTCC CGTCCGGCCC GAACCGATCC GCGGCGAGGT CGCTCGCCGG
ACGGCCGCGA CGGGGCGAGG CGTTCCCGGC AGCCGCCCCA GGGTGGGGTC CGCGTCCGGG
ACCGGACGAT CAGGAGTGCA TTCCCGACCG GCCGTCGCGC CGGCCGGTCC GCAGCCCGTG
ACATCCGCCA CGTCTCCCAC CGGGGCAGCC GCCAAGCTGC CGGTCCCGGG CTCGGCGCAG
GTGCCTCCCG TCGCCGGCGG GGAGCGGGGC GGGACAGCCG CCGCGGCCCC GGCCGGACCG
GAGCCGTCGG CCCCAGACGT CGGTACGGCA GCGCGATCCG CTCCCGCCGA ACCGGCCGCG
CTAGCGGGCA CCGATGCCGC CGCAGCCGCC GCGGCCGCCG TGACCTCGGC ACCAGCCGCC
CGAGCCGCCG GATCCGGCAC AGATTCCACG CCCGGGGCGC GCCGCACGGC AGCCCGGCGG
GAAGCCCGGC GGGAAGCCCC CGCCAAGGCT CCCTCCGCGC AGACGGAGGA TGCGCCGGAC
GGGACCGACG AGATGACCGT AGACACATCG AGCGCCTCCG CTGCGTCACG GGCACGCCCG
AGTAACACCG CCCGTGGGAA CGAACGTCCG GCAGGCGGTC GACGCGGACG CAAGTCAGTG
CCAGCATGGG ACGACATCGT CTTCGGTGCC CGACGTCCCT GA
 
Protein sequence
MRELRAVALS EDGGYLVLTD ANGRTDGEQF RVPVDDRLRA ALRGVRRSEV RTESALTPRE 
IQARLRAGET AAEVARAAGI PVERVERYEG PVLAERARVV QEARAALLPK DPGGVPGRPL
GEVVDARLMG AQDNPAAAQW DAWRRVDGIW LVQLTSESRC ARWTWDPVVR RVRPHDDAAR
TLVAPETAEQ PAPVPAPSSV PSSSAALRAA GPALTLVHDQ GIPATPATPV QPMVAAMAES
GLPPGTPAPQ HGLPERQYPP PAALPSHPAA SYPGDNAYPG DNAYPGDTAG GGIAPGSPGS
PGSPTRPETI QAAEPQPVVA DTRSADTRAV TPWSVPARSA DPAAPGDPRA VERKAARAEE
AESLDRPVRP EPIRGEVARR TAATGRGVPG SRPRVGSASG TGRSGVHSRP AVAPAGPQPV
TSATSPTGAA AKLPVPGSAQ VPPVAGGERG GTAAAAPAGP EPSAPDVGTA ARSAPAEPAA
LAGTDAAAAA AAAVTSAPAA RAAGSGTDST PGARRTAARR EARREAPAKA PSAQTEDAPD
GTDEMTVDTS SASAASRARP SNTARGNERP AGGRRGRKSV PAWDDIVFGA RRP