Gene Francci3_2257 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2257 
Symbol 
ID3905025 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2634954 
End bp2636708 
Gene Length1755 bp 
Protein Length584 aa 
Translation table11 
GC content75% 
IMG OID637879588 
Producthypothetical protein 
Protein accessionYP_481354 
Protein GI86740954 
COG category[S] Function unknown 
COG ID[COG2187] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.197929 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00613294 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGACCATCC AGTACGTGCG GGATGCGACG CCCTCCACCG GGGCGGACGA CCATCACGCG 
GCCGGAGGGA TGTTGTCCCC GACGAACCCG CCGTCCCCCG CCTCAACCCC GGCCCAGCCC
CGGCTGCGCG CCCTGTACCT GTCCGAGCTG GAGACGTTCG AGACCTACGA GACGCACAGC
GCCACGCTGC ACCTGGCCGC CGATCGCGTG TACAAGCGCA AGAAGCCGGT GAACCTGGGC
TTCCTCGACT TCACCGACCG CCGCACCCGG GAGTCGGTCT GCCGGTCGGA GGTCGCGCTC
AACCGCCGGC TGGCCCCCGA CGTCTACCTG GGTGTCGCCG ATCTCCTCGA CGACACCGGT
GAGGTGATCG ACCACCTGGT CGTGATGCGG CGGATGCCGG CGAGCCGGCG GCTGTCCACC
CTCGTCCGCC GGCGCAGCCG GGTCGGCCCG GCACTGCGCA CGGTCGCCCG GGCGCTGGCG
GTGTTCCACC AGCGGTGCGA GACCTCACCG GAGATCGCGG TGGCGGGGCA GCGGGCGACC
CTGGAGGGGC TGTGGCGGGA GGGCCTGGAA GGCATCTCCC CCTACCGCGG CACCCTGCTG
GACGCGGCGG TGGTCGACGA GATCGGCGAA CTGGCGCTGC GCTACCTGGC CGGCCGGGAG
ACCCTGCTCG GCGATCGGGT GCGCGCCGGG TGGATCCGCG ACGGGCACGG CGACCTGCTC
GCCGACGACA TCTACTGCCT CGGCGACGGA CCCCGTATCC TCGACTGCAT CGAGTTCGAC
CCGCGGCTGC GCTTCGGTGA CGTCCTCGGC GACGTCGCGT TCCTGGCGAT GGATCTGGAA
CGCCTCGGCG CGCCGGAGGA GGCCGCCGAG TTCCTCGACG CCTACCGGGA GTTCAGCGGC
GAGGTGCACC CGCGGTCGTT GCAGCATCTC TACGTCGCCT ACCGGGCGTT CGTCCGGGCG
AAGGTGACCT GTATCCGGGG CGGGCAGGGT GATCCCGACG CGGCCGAGGA GGCCCGCCGG
CTGCTGGCCG TCGCCCACCG TCATCTGCGG GCTGGCCGGG TCCAGCTCGT CGTGGTCGGC
GGGCTGCCCG GGACGGGCAA GACGACCCTG GCCGGCCGGC TGGCCGGGGT CGGTGACGGC
TGGGTGCTGC TGCGCTCCGA CGTGATCCGC CAGGAGCTGA CCGGGATGCC CCTGCGTGAG
GGCGGGCCGG CCGCGGACAC CACCGCCGGC GGGTATGCCA GTGCCCTGCG CAACGCCAGC
GGCACCGCCA CGAGAACCGG GGCCCGCCGC GACGCCGGTA CCGGCGCGGC CGCGACCTCC
GACCCCGCGA CCTCCGACCC CGCGGACGGC GACCCCGCGA CCTCCGACCC GCGGTTCGGC
ACCGGGCGCT ACGCCCCCGA GATCACCGAC GCGACGTACG CCGAGATGCT GCGCCGCGCC
GAGGCGGCTC TCGCCCGCGG GGAACGGGTG GTGCTGGACG CATCCTGGTC GAGCGCGCGT
CACCGCCGGG CCGCCGCCGA GCTCGCCGCA AGCGTCTGCG CCGACCTGGT GGAGCTGCAC
TGCGTGACGG CACCGGAGGT GGCGGCCGCC CGGATCGGGC GCCGCGCCGC CGCGGGCACC
GACCCGTCGG AGGCGACGAT GGCCATCCAC CGGGCGATGG CTGCCCGTGC CGACCCCTGG
CCGTCGGCGA CGGTGGTACG CACCGCCGTC CCGGTCGCCG AGGCCCTGCA GACGGTCCTC
GCCCACCTCG ACTGA
 
Protein sequence
MTIQYVRDAT PSTGADDHHA AGGMLSPTNP PSPASTPAQP RLRALYLSEL ETFETYETHS 
ATLHLAADRV YKRKKPVNLG FLDFTDRRTR ESVCRSEVAL NRRLAPDVYL GVADLLDDTG
EVIDHLVVMR RMPASRRLST LVRRRSRVGP ALRTVARALA VFHQRCETSP EIAVAGQRAT
LEGLWREGLE GISPYRGTLL DAAVVDEIGE LALRYLAGRE TLLGDRVRAG WIRDGHGDLL
ADDIYCLGDG PRILDCIEFD PRLRFGDVLG DVAFLAMDLE RLGAPEEAAE FLDAYREFSG
EVHPRSLQHL YVAYRAFVRA KVTCIRGGQG DPDAAEEARR LLAVAHRHLR AGRVQLVVVG
GLPGTGKTTL AGRLAGVGDG WVLLRSDVIR QELTGMPLRE GGPAADTTAG GYASALRNAS
GTATRTGARR DAGTGAAATS DPATSDPADG DPATSDPRFG TGRYAPEITD ATYAEMLRRA
EAALARGERV VLDASWSSAR HRRAAAELAA SVCADLVELH CVTAPEVAAA RIGRRAAAGT
DPSEATMAIH RAMAARADPW PSATVVRTAV PVAEALQTVL AHLD