Gene Francci3_1123 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1123 
Symbol 
ID3905465 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1335454 
End bp1336731 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content67% 
IMG OID637878455 
Producthypothetical protein 
Protein accessionYP_480232 
Protein GI86739832 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCACGC GGCGGGGGCT ATTCGCCGAG TTGCAGTACC AGGCAGCGCA GGCCGAGAAG 
CGGCAGCGGC AGCAGAGGGC TGCGGCCCAC CGTGCACTCC TTGCCGCTGA AAAGGAAGCC
GCGCAGAAGG CTCGGGCCGC TGAGCGCGCA GCCGCCGCGG CGGCCAAGGC GTCCACGAAA
GAACAGGCCC GCTTGCTGAA GGAAGCGGGC CTGCTGTACG TCGCGGCGCG ATTGAGCGAG
GTCGGGTCGC TGAATGCCGA TCTCGCGAGC ACATTCGAAG AGATCGACGG CATTCTGGCC
ACGGCGCTCC TGGTCGACAG CTATGTTGAT CTCGAGGCCT TGAAAGTCAC GACCGTCGTG
CATCCACCGT TCGAGCCCGG TGCTCTCGCA GTCCCGACAC CGCCCGTTGC TGCTCCTGTG
TACCCGGCGG AACCCGTCTA CCAGGAGCCG CAAGTACCGG GAGTCCTATT CGGGGCGAAG
AAAAAACATG CCCAGGCGAT CGCTCAAGCT CAGACCACGC ACGAGCAGGC GCTGCGGCGG
TGGCGGGAAC AGGTGTCAGC GATACGGACG GCGCATGTCT CCGCTCTGGA GCAGCGGCAG
CGTGCCGAGG ACGCTCGCCT GGCGAAGCTC GCCGCGGCGC GGGCGGTCCA CGTCGAAGCG
TGTCGTCGGA GAGACGCCGA CGCCGACGAG CGGAACCGGG GACTCACCAG ACTGATCAAT
GATCTCGCCT TCGACGTCGA GGCGGCGATC CGCGAGTACG TCGGGATCGT CCTGTCGAAC
TCCGCCTACC CGGATGCATT TCCAGTCACT CACGACTACG AGTTCGATCT CTCCAGCAGG
GAGCTGCGTC TGGCAGCCGC TGTCCCTGAA CCATCCGCAG TCCCCTCGGT CAAGGAGTAC
AAGTACGCCC CGAGGAAGGA CGAGATCTCG TCGACCAAGC TGCCGGCGAC GGTCCAGAAG
GACCGCTACG CGAGCGCCGT CTTCCAGACC GCCGTACGCA CGCTGCACGA CGTCTTCGGC
GCCGACCGCC AAGGAAAGAT CCACTCGATT GCCCTGACGG TCGGGGTCGA CCGGATCTCG
CCTGCGACCG GGCTTCCGGA GACCATCCCG CTCGCGATCG TCGCTGCAGA CCGCGCAACG
TTCCGCAAGT TCCGGCTCGA CCAGGACGAG ATCGTCCCTC AGAAGACCCT CGAGTACCTC
GGCGCGGCGC TGTCCCCTTC GCCGTTCACT CTCAAGCCCG CCGACGCCTC CCGCGGCATC
AGGCAGCGTG GGCAGTGA
 
Protein sequence
MATRRGLFAE LQYQAAQAEK RQRQQRAAAH RALLAAEKEA AQKARAAERA AAAAAKASTK 
EQARLLKEAG LLYVAARLSE VGSLNADLAS TFEEIDGILA TALLVDSYVD LEALKVTTVV
HPPFEPGALA VPTPPVAAPV YPAEPVYQEP QVPGVLFGAK KKHAQAIAQA QTTHEQALRR
WREQVSAIRT AHVSALEQRQ RAEDARLAKL AAARAVHVEA CRRRDADADE RNRGLTRLIN
DLAFDVEAAI REYVGIVLSN SAYPDAFPVT HDYEFDLSSR ELRLAAAVPE PSAVPSVKEY
KYAPRKDEIS STKLPATVQK DRYASAVFQT AVRTLHDVFG ADRQGKIHSI ALTVGVDRIS
PATGLPETIP LAIVAADRAT FRKFRLDQDE IVPQKTLEYL GAALSPSPFT LKPADASRGI
RQRGQ