Gene Francci3_3471 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3471 
Symbol 
ID3905205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4137224 
End bp4139152 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content72% 
IMG OID637880793 
Producthypothetical protein 
Protein accessionYP_482553 
Protein GI86742153 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.515969 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.494925 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGTGGGC GGGACGTGGG TGGGCGGGAC GTGGGTGGGC GGGACGTGGG TGGAGGTGCT 
GAGGCCAGGG CGGGGCAGCA CCGCGTCGGC CGTGGTCGCC GGCGTCACCG TGGTCGCCGA
GCCCGGTACC GCCGGCTGGC CGCCGGGGTC GCGACGGCCC TCGTTGTGCT GGTGGCGATG
ACCGCCTGGG TGGTCGTGCG CGGAGTGCTG GCCAAGTCCC GTCTCGACGA GGCCAGGCAG
CGGATCGCCG TCCTGCAGCG GCAGGTTCTG AGCGGAGACT TTCCCCGCGA GGCCGAGCTG
CGATCGCAGA TCGAGCAGAT CCGGCGGCGG GCCACCGCTG CGCGGGCCCT GACCTCGGAT
CCGGTGTGGT CGGCGTTCGG GCGGCTGCCC GTCGCTGGCT GCCCGATGCG CTCGGCGGCG
ACGCTGATCC GAGAGGTGGA CGCGGCCGCC GGCACCAGCC TGCCGGCCGT GGCCGACCTG
GGTCCCTCCC TCGATCCGCG GGTGCTGCGC CAGCGTATGA CGATCAATGT CCGGGCGCTC
GCCGCGGTCC GCCGCCCGGC CGAGCGATCC TTCAACGCCC TCTCCGCGTT GCGGGCGGCG
GCGGAGAACG TACCGGACTG CGGATGGGCC GGGCGGGTCA GCGGCATCGC CGACGCCCGC
GCCGAGATGA TCGATCGGAG CCGGCGCCTG GCCGGTGCGC TGGACACCGT TGTTCTGGCC
GCCCGGGTCG GTCCCGAGAT GCTCGGCGGG GGCGGTGTCC GTCGCTACCT GCTGATCGTC
CAGAACCCGG CCGAGTCCCG CGCCAACGGG GGGATCATCG GTGGGTTCGG CCTGCTGACC
GCGGAGCACG GGCGGCTGTC CATCGACGGC ATCTCGGGCA ACGGTGCTCT GCCGGGAGGC
CCCACCCAGC AGCGACCGGC GACGGGGCTG CCGGTCCCGT TCGCGGCCCG TTACGGCGCC
TTCTGGCCCG ACCGTATCTG GGCGAATATC AACCTGACCC CCGACTATCC GATGGCCGGC
AGGCTCTACA GCGCGTTCTA CCGGGCCGGC ACGGGCCTCG ACGTCGACGG CACGATCAGC
CTCGACCCCA CGACGTTGTC GTATCTTCTC GCCGCGAGCC GGCCCGCGGT GCTCCCCGAC
GGCACGTCGG TGGCCGCGGG GCACCTGGTC GATCTCGTCG AGTCGCGCGT CTATGGCGAG
ATCATGGACG CGGCCGCCCG CGACCGCTTC TTCGCCCAGG TCGGTCAGGC CGTCTATGCG
GCCGTGGAGT CGGGAGCCGG CGACACGACG AAGCTGGTGA CCGCGCTGGG ACGGGCCGCT
CGCGAGGGTC GGCTGGAAAT ATCCAGCAAC CACGCCGAGG AGCAGCGGAT TCTTTCGTCC
ACGGCGCTGG GTGGTGCGCT GCCGGACGCG CCTGGGCCGT TTCTCGGGGT CGTCACCCAG
AACGCGACGG CGAGCAAGCT GGACTACTGG CTACGGCGGC AGACCACCTA CCGCATGCAG
CGGCAGCCGA ACGGCGCCGG CCTGGCGACG ATCACGATCC GGCTCACCAA CGCCGCCCCC
GGCGGGCTGC CGGCCTATGT GCGCCACCGG CAGGATCTCA AGGATGCCGC TGGGAATCTC
CGGGCGCAGA ACAATCTCTG GCTCTCGGTG TACACGGGCC GAGGCAGCTG GCTCGTCGCT
GCCCGGCTCG ACGGTGTGCC CATCGGCCTC GCCGGCGGTT CCGAGTCCGG GCATCCCGTG
CTCTCCACCT ATCTCACCGT CGATCGCGGC CAGACCCGGA CCCTGGAAAT CAAGGTCCGG
GAGCCGGTAG GCGGCCCGGC ACTCACCGTG CGTCCACAGC CGTTGCCCGT CGCGGAGCGC
CTGGAGGTGC AGGGGCTACC GGTCGTCCCT CCCTGGTCAT CCCAAGGGTC GTCCCAAACC
CAGAACTGA
 
Protein sequence
MGGRDVGGRD VGGRDVGGGA EARAGQHRVG RGRRRHRGRR ARYRRLAAGV ATALVVLVAM 
TAWVVVRGVL AKSRLDEARQ RIAVLQRQVL SGDFPREAEL RSQIEQIRRR ATAARALTSD
PVWSAFGRLP VAGCPMRSAA TLIREVDAAA GTSLPAVADL GPSLDPRVLR QRMTINVRAL
AAVRRPAERS FNALSALRAA AENVPDCGWA GRVSGIADAR AEMIDRSRRL AGALDTVVLA
ARVGPEMLGG GGVRRYLLIV QNPAESRANG GIIGGFGLLT AEHGRLSIDG ISGNGALPGG
PTQQRPATGL PVPFAARYGA FWPDRIWANI NLTPDYPMAG RLYSAFYRAG TGLDVDGTIS
LDPTTLSYLL AASRPAVLPD GTSVAAGHLV DLVESRVYGE IMDAAARDRF FAQVGQAVYA
AVESGAGDTT KLVTALGRAA REGRLEISSN HAEEQRILSS TALGGALPDA PGPFLGVVTQ
NATASKLDYW LRRQTTYRMQ RQPNGAGLAT ITIRLTNAAP GGLPAYVRHR QDLKDAAGNL
RAQNNLWLSV YTGRGSWLVA ARLDGVPIGL AGGSESGHPV LSTYLTVDRG QTRTLEIKVR
EPVGGPALTV RPQPLPVAER LEVQGLPVVP PWSSQGSSQT QN