Gene Francci3_0259 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0259 
Symbol 
ID3903667 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp299713 
End bp301281 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content73% 
IMG OID637877587 
ProductCHAD 
Protein accessionYP_479376 
Protein GI86738976 
COG category[S] Function unknown 
COG ID[COG5607] Uncharacterized conserved protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.822007 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCAACG GAATCCGCGA GATCGAGCGC AAGTTCTCGG TCGAACCGAC CTTTGTGCTC 
CCGAAGTTGG GGGAGGTTGC GGGCGTCGCC ACCGCGCGCA CCCGCAAGAC CGTCAGCCTG
GAGGCCGTCT ACTACGACAG CGACGATCTC CGATTGGCCC GTAACAAGAT CACGATACGG
CGTCGGACCG GTGGGGCGGA CGCCGGATGG CACCTCAAGC TGCCCGTCCG GGTGGGGGAG
CGGGACGAGC TCCAGCTCCC GCTCGACGCG GGCGTGGGCG TGGGCGTGGG GGCGGAGGGC
CCGCGGTACA GCCCTCCCGC GGAGTTCGTC GATCTGGTGT CGGTCCATCT GCGCGGCGCC
GAGCCGCGGC CGGTCGCCAG GCTGCGGACA CTGCGGACAG CGCGCCGACT GCGGGACACG
GTCGGAGTCG ACCTGGCCGA GGTCGTCGAT GACCAGGTCT CCGCGCAGAC CCTGGGGGAG
ACGACGGTAC TGAGCAGCTG GCGGGAGATC GAGGTCGAAC TCGTCAACGG CGGGCCCGAG
GTGCTCGACG AGGTCGCCGG TCTGCTCACG GCCGCGGGCG CCACCCCGGC GGCGGACTCC
TCGAAGCTCG CGCGGGTCCT GGGTGAGGCG CTGGCCGCCG GCCCCGGGCC GGACGTTCCT
TCGCCGCCGC GCAAGCCACG ACGCGGGACG CCGGCGGGCG AGGTGGTACG GGCCTACCTG
ATCGAACAGG CACGTGCCCT GCTCGCGGCC GATCCGCGGG TGCGGCTCGA CGAGCCCGAG
GCGGTCCACA AGATGCGCGT CGCCTGCCGT CGGGCCCGCA GCACCCTGCG GACGTTCGCG
CCGCTGTTCC CGCCCGAGAG AGCGCTCTTC CTGGACGGCG AGCTGCGGGA CCTCGCCGGC
GCGCTCTCCG GCGCCCGCGA CGCCGAGGTC CAGGCCGCCT ATTTCGAGAC CCGCCTGGCG
GAGCTGCCCA CCGAGCTGGT CGCGGGGCCC GTGCGCAAGA CGGTCACCGC GCACCTCGGC
GCCGGCACGG CCAACGGCCG GGCGGAGGCG TTGGCCATGT TGCGCAGTGA CCGGTACTTC
GCGCTCGTCT CCAACCTGCT CACCCTACTG CGGGGCCCGC TCACCCCCGC GGCGGCCCGT
CCGGCCGGCA AGGCCCTCCC CGATCTGCTG CTCGGCGCCG ACCGGAAGCT GGCGAAGAAG
GTCCGTGCCG CGAGCGCCCT GAAGGCCGGC TCGGAACGGG ACGAACTGCT GCATTCCGCC
CGCAAGCAGG CCAAGCGGTT ACGGTACGCG GCGGAAGCCG TCGCACCGCT GTACGGGAAT
GACGCGGCGC GGCTGGTCGA GCAGGCCCAG ATCGCGCAGG AGCTGCTCGG AACCCATCAG
GACGCCACCA TCGCGCGCAG GCTGCTGGGG GACTGGGGGA CGGCAGCGCA GGCCCAGGGC
GCCCCCACCG CGTTTACTCT GGGTGTCCTG CTGGGCCTGG AGGAGTGCCG GGCACGCATG
GCGGAACGAG ACTTCTTCGA TGCGTGGCCC GCGATCTCGG CAGCCCGGCA CCGTCGCTGG
ATCCGCTGA
 
Protein sequence
MVNGIREIER KFSVEPTFVL PKLGEVAGVA TARTRKTVSL EAVYYDSDDL RLARNKITIR 
RRTGGADAGW HLKLPVRVGE RDELQLPLDA GVGVGVGAEG PRYSPPAEFV DLVSVHLRGA
EPRPVARLRT LRTARRLRDT VGVDLAEVVD DQVSAQTLGE TTVLSSWREI EVELVNGGPE
VLDEVAGLLT AAGATPAADS SKLARVLGEA LAAGPGPDVP SPPRKPRRGT PAGEVVRAYL
IEQARALLAA DPRVRLDEPE AVHKMRVACR RARSTLRTFA PLFPPERALF LDGELRDLAG
ALSGARDAEV QAAYFETRLA ELPTELVAGP VRKTVTAHLG AGTANGRAEA LAMLRSDRYF
ALVSNLLTLL RGPLTPAAAR PAGKALPDLL LGADRKLAKK VRAASALKAG SERDELLHSA
RKQAKRLRYA AEAVAPLYGN DAARLVEQAQ IAQELLGTHQ DATIARRLLG DWGTAAQAQG
APTAFTLGVL LGLEECRARM AERDFFDAWP AISAARHRRW IR