Gene Francci3_2108 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2108 
Symbol 
ID3905635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2474326 
End bp2475531 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content66% 
IMG OID637879443 
Producthypothetical protein 
Protein accessionYP_481209 
Protein GI86740809 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0876721 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.445342 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGGAG CCGAGTTCGG CAAGATCATG CAGCGGATCG GGGGCCGCCT CATCGTGACC 
GATGGGGAGA AGGCCACGAC CTACGATTTC CGCCGGCAGC GCGACCGTGA CTCATGGCAT
GGCGCGGTGG GGAAGTCCGT CAGTGATTCG GGTTTGAAGT CCGAGCTGGT CAAGCGGAAG
TTGGACGCGA AGCGCGAAGC CCTGGAGTTC CTGGGTGGCC CCGTGGGGTT CGGGTGGTCC
CAGACGATCA CCCGGTCAGG TAAGAAGATC GTGACGGTGT GGTCCGTCGA CGAGGAGCAG
GCCCGCTGGC TACGCGAGGC CGCGCGGCGT ATCCGCGAAG GTGAGGCCGT TCTCAAGGTC
TCGGATGATT TCTACGACCG TGGGCTTCGG ATCCCGCACC GGCGCACGCA CCCGGGCGAC
ACGATGAAGA GCGGCAGCCT GACGCGCGCC AGCCTCTCCG CGATGCTGCG TAACCCGAGG
ATCGCTGGTC TGTTCGCGAC GGGAAACGTT CACACGGGCT GGACCGTGAA GGGCCCGATG
GCGAACTTCC CCGCGATCCT CACCGAGGAG GAGTGGCGGG AGACATGCGC GGCGCTGGAA
GCGGTCACGA CCCGCAAGGG CACGGGTACG GCCGTCAAGC ACACGTTCGC CGGGTACTAC
GTGTGCCACA AGTGCAGGCG TTCCCTGGTC CGGAACTCTC CCCGCGCGTA CGCCCTGTGG
CGGCATCGTC TCGGGAAGAG CCGTGAACAC TTCGAGTGTG ACCAGTCGTT CCACATCAAC
GCCGCCGACG CGGACGACCT GATGACCCGC CTGGTTGACG CCTACCTACG CCGCCGAGAC
TGGGAGAAGA CCGGCGACGT CGCGGACGGT GACGAGCTGA AGGCCGAGCG GACCGAGAAG
GAACGCGAAC TGGCCGATCT TCCCCGCGCG ATCGCCGCCA AGGAGATCAG CCTGCGGCTG
GGTGGCCAGC TCGAGGCCCA GTACGAGACC CGGCTACGGG AGATCGACGC CGAACTGGCC
CGCCGCGCGC GTCTCGTGAC CGTCCTGGAC GGAGCGGAAG CGCTCCGACT CTGGCGCGGA
GGCACCCTCA CGGAGAAACG CCGTGTCCTG TCAACGATCA TGGTGAAGAT CATTGTGGTT
CCCGGGAAGG ATCTTCCGTT GCGGGAACGG CTGGACCCGC AATGGCGCTA TCCCGGACCT
GCCTGA
 
Protein sequence
MVGAEFGKIM QRIGGRLIVT DGEKATTYDF RRQRDRDSWH GAVGKSVSDS GLKSELVKRK 
LDAKREALEF LGGPVGFGWS QTITRSGKKI VTVWSVDEEQ ARWLREAARR IREGEAVLKV
SDDFYDRGLR IPHRRTHPGD TMKSGSLTRA SLSAMLRNPR IAGLFATGNV HTGWTVKGPM
ANFPAILTEE EWRETCAALE AVTTRKGTGT AVKHTFAGYY VCHKCRRSLV RNSPRAYALW
RHRLGKSREH FECDQSFHIN AADADDLMTR LVDAYLRRRD WEKTGDVADG DELKAERTEK
ERELADLPRA IAAKEISLRL GGQLEAQYET RLREIDAELA RRARLVTVLD GAEALRLWRG
GTLTEKRRVL STIMVKIIVV PGKDLPLRER LDPQWRYPGP A