Gene Francci3_1511 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1511 
Symbol 
ID3904977 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1802483 
End bp1804384 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content72% 
IMG OID637878848 
Producthypothetical protein 
Protein accessionYP_480616 
Protein GI86740216 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACGCCT ACGCCCGGCC CGGTCGTCGG CCCGGCGATG CGCCCCCGGT GTGGCGCAGC 
CGCCGGGAGA ACGAGCTGTT CGCCGGTCGC GACGACGAGC TGGAACGCAT CTGGGACGGC
CTGACCCGGC ATCGGCGGGT GGTGCTGGTG CCGGAGGGGG ACCAGTCCGA CATCGGGGAG
ACCGAGCTCG CCGGCGAGTA CCAGCATCGG TTCAAGCTGC GCTACGACGT CTCGTGGTGG
GTGGACTGCT CGACGACCGC CGCCGTCCCC GGGCAGATCG GCGAGCTCTA CGAGCGGGCC
CGCACCGAAC TTCCCGGCCC CCCTCCGGGC GCGGCCGACG CCGGGCCGGA GTCCACCGCG
GGATGGTTGG TCATCTTCGC GGGTGTGGGC AGTCCGGACG AGGTGGCGGA GTTCCTGCCC
GACGGCGAGG CGCATGTGAT CATCATCGCC GACCGCGCCG TGGGCGCCTG GCGGGACCGG
ACGATGCCCA TCGGGCCATT GCGTCGTCGC GAGTCGGTCA TGCTGCTCAC CAGCGCGGCG
CCGATGGTCG ACCCGGCCAC CGCCGCCCAG CTGGGGGAGT TGGTCGGACA CCGGCCCGCC
CTGCTCGCGG AGATCGCCGG GTACCTGATC CGCGAGGCGG TCGTGTCGCC GGAACTGTGT
CGGCGCCTGC TGGAGATGGC GGCGTCGCGG CCGACGCCGG CCGTCCGGGA CGCCGCCGGC
GGTGACCAGT CGTCCCGGGC CCAGGCGATG CGTGGTTCCG GAACCCCCGC CGGGGCCGCG
TCCGGGGCCG CGTCCGGGGC CGCAGCCGGG GCCGGGGCCG CCGTCATGGT CGCAGGCGAG
GTACGTGACC CGGTAGGGCG CAACGCGGCG CGGGCACCAT CGATTCCTGT GATCAACAAG
CACGGCTGGC CCCCCCGGGA GGTCGACGAG CTCGTCGCGG CCCTGATGCG GGTGGAGTAC
ATCGCTGACC TGGCCGGTTT CGACCACTGG TTCGACGAGC TGACACGGAT CCTCGGTCGC
ACGATCGCGC TGACCTCCCC GGTCGTCGCG GTCCGCCTGA CCACGCTGGT CAGCGAGGCG
GTCGGCCAGC CCGATCCCGG CATGCTCGAC GCCTTGCTGC AGGCCCTCGA CCTGGTCGCG
CCGCGCGACG ACCGGTCCGT CGTGGACTTC CGGCGCTTGG TGACCGAGCT GCAATCCCAC
TGGAGCGGCG CGGGCTCGGC GCTGCCCGGC ATGTCATCAC CGGTGCCCGC CTACCAGCTT
CCCTCCTGGC CACCGCTGCT GTCTGGAGTG CCCTCCTCAC CCCCGCCCAG CCACGGGCCG
TCGGGCACCC CGACCTACTA CTTCTTCACC AGCCACGCGC ACCGCGACGA CCGGGATCGC
GTGGCCATCT TCCATCGGGA GCTCGAGCTG GAGCTGCGCC GCAAGGTCCG GCGCCGGATC
CGGCCGACGG GATTCTTCGA CGCCGACCGG CTGGGCGGCG GGGAGCACTG GCCGACCTCG
CTGCGCGACG CGGTGCGCAC GGCTCCGGTG CTGGTCGCGC TGTGGTGTGA CGACTACTTT
GAGAGCGACT GGTGCGGCCG GGAGTTCGGC GTTTTTCAGG AACGTATCCG CCGGGCGACC
AAGCCGGGCG GGAACCCGCC GTCCGGGATC ATTCCCGTGC CCTGGCTGCG GCGGGACGCC
GAGGTACCCG AGGCGGCCCG TGAACTCCAC ATCGCGCATA TGGAGCTTGG TCGTCAGTAT
GACAACCTTC CGGTCCTGGA TTTGATGCGC CATCCCGCCG CCTTCGCGGA GTATGTAAGT
CTGCTGGCCT ACCGGGTCAT GGATGTCGCT CGCGACCAGC TGCCGCCGTT GGACGCCGAG
GTGACGGAGC TCGTCCGTTC CCCGTTCCAC CATCAGCCGT GA
 
Protein sequence
MNAYARPGRR PGDAPPVWRS RRENELFAGR DDELERIWDG LTRHRRVVLV PEGDQSDIGE 
TELAGEYQHR FKLRYDVSWW VDCSTTAAVP GQIGELYERA RTELPGPPPG AADAGPESTA
GWLVIFAGVG SPDEVAEFLP DGEAHVIIIA DRAVGAWRDR TMPIGPLRRR ESVMLLTSAA
PMVDPATAAQ LGELVGHRPA LLAEIAGYLI REAVVSPELC RRLLEMAASR PTPAVRDAAG
GDQSSRAQAM RGSGTPAGAA SGAASGAAAG AGAAVMVAGE VRDPVGRNAA RAPSIPVINK
HGWPPREVDE LVAALMRVEY IADLAGFDHW FDELTRILGR TIALTSPVVA VRLTTLVSEA
VGQPDPGMLD ALLQALDLVA PRDDRSVVDF RRLVTELQSH WSGAGSALPG MSSPVPAYQL
PSWPPLLSGV PSSPPPSHGP SGTPTYYFFT SHAHRDDRDR VAIFHRELEL ELRRKVRRRI
RPTGFFDADR LGGGEHWPTS LRDAVRTAPV LVALWCDDYF ESDWCGREFG VFQERIRRAT
KPGGNPPSGI IPVPWLRRDA EVPEAARELH IAHMELGRQY DNLPVLDLMR HPAAFAEYVS
LLAYRVMDVA RDQLPPLDAE VTELVRSPFH HQP