Gene Francci3_2023 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2023 
Symbol 
ID3906739 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2378246 
End bp2379649 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content73% 
IMG OID637879359 
Producthypothetical protein 
Protein accessionYP_481126 
Protein GI86740726 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.898641 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGGCAGCC GGAACCGGGA ACGACGCAAG GCAAAACAGA AGGCCCGCGC GAACCGTGTC 
CGCGCCCAGG CGCCGGAGTC CGGGGCAGGG GCCTTCTACC AGGAGGAGCG CGGCCAGCGG
GAGATGTTTC GCCGGATCGC CGATCAGCTG GTCGCGGCGG CGCTGAACGC CCAGCTCGCC
CGGGATGAGG CGGCACTGGC CGAGTACGTG GAGTTGCTCG TCGCCGCGCC GGGCGGGCCG
GCCGGACGCC GAGTTGTCAA TCGGTCCCTG GCCGGGTGGT TCGACCGCAC CGTCGAGGCC
GCCTGGCAGC GCGGCTGGCA GCCGGCTGAC GTGCACCGGA TCGTGAACCG TCAGGCAGGC
CAGCGCCAGG CGCGACTCGC CGTCGACGCC ATCGCCGGCC AGATGCGGCA ATACGCGGCC
GCCACCGTCG ACGAGCGGTG GAAGAACCAG CTGTATGATC TTAACGCCGT TCTGTGGTGG
GAGGATGACG GCACCTGGCT GGACGCCTGG GGTGACCGGG AGGGCCGGGA CCGGGCGGAG
GCGCTTGCCG ACACCCTTGG TCTGCTCACC CTTCTGCACA CCCTTCCCGC GATCGAATCC
CTGTGCCCCC CGCCAGGCAC CACACGACGC GACCCACCAG GTCGCCCCGC GGGACGCGGC
CACGGCCAGG CGGACCCGGG CCACCGGGCC GGGGCCGGCC GGAGCGCCGA CCCACGCATC
CTCGACAAGG TCCGGGCGCT GCTGGCGAAG GCCGAATCAA CCGGGTTCGC GGACGAGGCT
GAGGCGCTGA CCGCCAAGGC ACAGCAGCTC ATGGCCCGGC ACAGCATCGA CGAGGCGCTG
CTCGCGGCGC GGGAGGGAAC CCGCGACGAG CCGGCCGGCC GCCGGGTCGG CGTCGACAGC
CCTTACGAGG CGGCCAAGGC CAGCCTGCTC GACGTGGTCG CCGGTGCAAA CCGATGCCGT
TCGGTGTGGA CAAAGAACCT CGGGTTCGCC ACGGTGATCG GTTTCCAGCC TGACCTCGAC
GCCGTCGAAC TGCTGTACAC CTCGCTCCTG GTCCAGGCGA CCGCGGCGAT GATGCAGGCC
GGGTCCCGCC ACGGGCGGTC CCGCACCCGG TCGTTCCGCC AGTCGTTCCT CGCCTCGTTC
GCGGTCCGGA TCGGCCAACG CCTGACGGCC GCTACCGAGC AGGCCAGTGA ACAGGCCGCG
GTCGAGGCGG GCGAGAGCCG GCTGCTGCCT GTGCTCGCCG CCCGTGGCGA CGCCGTGAAG
GAGGCAGCCG AGACGATGTT CCCGCAGGTC GTCGCCCGGG CGGTGAACGC GACCGACGGC
GAGGGGTGGG CGTCCGGCCG GGCCGCCGCT GACCTCGCCT CCCTGCACAC CTACGGCGAG
GTGACCACCG CCCGGTCCCG ATAG
 
Protein sequence
MGSRNRERRK AKQKARANRV RAQAPESGAG AFYQEERGQR EMFRRIADQL VAAALNAQLA 
RDEAALAEYV ELLVAAPGGP AGRRVVNRSL AGWFDRTVEA AWQRGWQPAD VHRIVNRQAG
QRQARLAVDA IAGQMRQYAA ATVDERWKNQ LYDLNAVLWW EDDGTWLDAW GDREGRDRAE
ALADTLGLLT LLHTLPAIES LCPPPGTTRR DPPGRPAGRG HGQADPGHRA GAGRSADPRI
LDKVRALLAK AESTGFADEA EALTAKAQQL MARHSIDEAL LAAREGTRDE PAGRRVGVDS
PYEAAKASLL DVVAGANRCR SVWTKNLGFA TVIGFQPDLD AVELLYTSLL VQATAAMMQA
GSRHGRSRTR SFRQSFLASF AVRIGQRLTA ATEQASEQAA VEAGESRLLP VLAARGDAVK
EAAETMFPQV VARAVNATDG EGWASGRAAA DLASLHTYGE VTTARSR