Gene Francci3_1829 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1829 
Symbol 
ID3906220 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2166148 
End bp2167707 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content73% 
IMG OID637879167 
Producthypothetical protein 
Protein accessionYP_480934 
Protein GI86740534 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.790785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones30 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGGT CCAGCCGGCG GGCGGAACAG GACGAGCTGC GGGCGCGGAT GCGCGCGGCC 
GGCATGTCCC ACGACGAAAT CGCCGTCGAG TTCGCCCGCC GCTACGGCTA CCGCCCCCGC
GCCGCGCACC GCCACGCGCA CGGCTGGTCC CAAGTACAAG CCGCCGGACA CATCAACAGC
CACGCCGCCC AGGTCGGCCT CGACCCCCGT GGCGCCGCCC CGATGGACGG CCCCCGCCTG
TCCGAGCTGG AGAACTGGCC GCTCCCCAAC AACCGCCGCC GGCCCACCCC CCAGCTCCTC
GCGCTGCTCG CCGAGGTCTA CGGCACCAAC ATCCACAACC TCATCGACCT CGACGACCGC
GAACACATGC CCCCAGCAGA CATGCTGGTC ATCACCACCA CACGCCAGGA CGCTCGGCCA
ACGCCGCCGA CGGGATCACC GGTGAACCTT GCACTGCCCG CGGCCCCGTG TTTCAGCCCC
ACGGCCCCGT GCGTCCAGTC GGGACGCGGT CATGGTGCGG CCAGGGGGCT GCGTCTCGAC
GGGCCTGCTG GAAGCATGGA GTGGGTGGAC GCTCTCGGCC GCCGCGGGTT CACTCTTCTC
GCGGGATCAG CAGTAGTGGC GGGTCTGGCC GGCGCCGGCC GCGCCCGCCA CGTCGACCCG
GCGCTCATCT CCTACTTCGA CGGCCAGCTC CAGGGCCACT ACCACGCGGA CATGCTGCTC
GGCGCCGGCG CCCTGATCGG CACGGTCGCC TCCCAATACG AGGTCATCAC GCAACTGCTG
GATGCCGCGG ACGGGTCGAC CCGCCGGGGC CTGGCGAAGG TCGGCTCCTC GTTCGCCGCG
TTCGCGGCCT GGCTGTGGCT CGACGCGGGA GACGCGGTCG CCGCGATGCG CTGCCACGAC
GCGGCCTTGG AACTGGCCCA CCGCTGCGGG GAACGCGACG CCGTCGCCTG CGCGCTGGTC
GATCGGGCGA TGGCCTTCAC CGACCTGGGA AACGCGGCGG CCGTGATCGA CCTGTGCCAG
GCCGCGCTGG GTGACGCCCG GCAGCTGGCA CCCGAAGTCC AGGTGTTCGC CTTGCAGCAG
CAGGCACACG GCGCGTCGCT GCGCGGTGAC CGTCGCCAGG TTGATCTCCT CCTCGATCAG
GCCGGCCGGC TCGTGGAGCA CGTCGAGGTC GAGGAATACG GCACGGCCTG CCGCCGCACC
GACGGCTACG TCGAAGTGCA GCGTGCAACC TGCTATGGAC GGCTGGGTCT CGCCGACGAC
GCTGACCGCC TCTGGCAGCA GATCATCCCC GCTGCCCCGG CCTCTGCCCG CCGGGACGTC
GGGGTCTGGT CAGCGCGCCA CGCGGTCGCC GCCGCGCGGC AGGGAGAACC AGAGCGGGCG
GTAGACCTCA CGCGCCAGGC CAGCGCGCTC GCGGTGGAGA CCGGCTCCGC GCGGGCCCGG
CGAGAACTGG CCGCGGTCGC GGCGGCGATG GCTCCGTGGC GCGCGCACCC CCTCGGCCAG
GAGCTCGCCG ACGTGCTCAC ACCCTTCACC ACCGACGAGA CCGGGAGAGA GCATGGCTGA
 
Protein sequence
MSRSSRRAEQ DELRARMRAA GMSHDEIAVE FARRYGYRPR AAHRHAHGWS QVQAAGHINS 
HAAQVGLDPR GAAPMDGPRL SELENWPLPN NRRRPTPQLL ALLAEVYGTN IHNLIDLDDR
EHMPPADMLV ITTTRQDARP TPPTGSPVNL ALPAAPCFSP TAPCVQSGRG HGAARGLRLD
GPAGSMEWVD ALGRRGFTLL AGSAVVAGLA GAGRARHVDP ALISYFDGQL QGHYHADMLL
GAGALIGTVA SQYEVITQLL DAADGSTRRG LAKVGSSFAA FAAWLWLDAG DAVAAMRCHD
AALELAHRCG ERDAVACALV DRAMAFTDLG NAAAVIDLCQ AALGDARQLA PEVQVFALQQ
QAHGASLRGD RRQVDLLLDQ AGRLVEHVEV EEYGTACRRT DGYVEVQRAT CYGRLGLADD
ADRLWQQIIP AAPASARRDV GVWSARHAVA AARQGEPERA VDLTRQASAL AVETGSARAR
RELAAVAAAM APWRAHPLGQ ELADVLTPFT TDETGREHG