Gene Francci3_3458 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3458 
Symbol 
ID3905698 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4122133 
End bp4124082 
Gene Length1950 bp 
Protein Length649 aa 
Translation table11 
GC content67% 
IMG OID637880781 
Producthypothetical protein 
Protein accessionYP_482541 
Protein GI86742141 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4124] Beta-mannanase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCACAG AGAACAAACG GGACGGGACA GCGTCCCAGC CCAGGAGATA TTCGAACGGC 
CGTCACGCGC GGACTCGAGC ACGTCGCCAC CGGATCGGCG GGCTGCTACT CACCCTCACC
GTGGCCCTGA TGACCGGGGC ATTGACCGGA CCTGCCCGCG CGCAGGCCCC GACGATCATC
GAGGACAGCG CCATCGGCAC CGGGCTCAAC CAGGTCTCCT ACACGGGGAC GTGGACCCGG
TGCACCGGGT GCATAGCCGG GCCCCTCGAC GACGGCTTCC GGTACTCGTC GACCTACGGC
AGCGTCGCCT CGTTCCAGTT CATCGGCACG CGGGTGACCA TCTTCGGGGT CAAGGGGCCG
TGGAGCGGCC AGGCCGCGAT CAGCATCGAC GGCAGCACCC CGCAGCTCGT CGACACCTTC
GCTACGGTCG CGGTCGCGAG CTCGATCTTC ACCTCGAACA CGCTCACCGC GGGCACCCAC
ACAGTGCAGA TCACCAACCT GCACCAGCGC AATGCCGCGT CCCGGGGCTA CGACGTCGCC
TTCGACCGCG CCGAGGCGAT CAACGAGGTG GCGCCGCCCC CGCCGACACC GCCGGTGCCC
ACGGCGCTCA CCATCGAGGA CACGACGATC GGCACCGGCA CCAACCAGGT CTCTTACTCG
ACCGGGTGGA CGAAGTGCAC CGGCTGCATT ACCTCCCCGA ACAACAGCTT CTACTACTCG
GCGACCGCCG GCGCGGTCGC CACGATCCGT TTCAGCGGAA CGCAGATCAA CATCTACGGC
GCCAAGGGCC CGATCGGGGG ATTCAGCACG ATCCGCCTGG ACGGCGGCCA GCCGGTCAAC
GTCGACACCT ACGCGCCGAC CTCGAGCATC ACCCTGTTCT ACAGCTCGGA CAAGCTCGCC
GCCGGCACGC ACACCCTGAC CCTGACGAAT ACCGGGCAGC GCAACACCTC CTCGCAGGGC
AACAACGTCG GTTTCGACCG GGCGGAGGTC ACCACCGGAG CGAACCCGCC GCCGGCCCCG
CCGTCCTACG CCGGGCCCCG CTCGGGCAAA GGGTGGCTGA CCGGAACCTA TCCCGATCCG
GTGATGAACC AGCAGAACCT GGAGGCGTTC TGCACCTGGC GCGGGGCGCC CTGTGACTTC
ACCCTGCTTT ACACCACCCG GAACAGCTGG GCGAACGTCA GCCAGCCGGT GGATCTGCTG
CGCACCTTCG CCAACTGGCC CGGCCGGTTG ATCATCTCCA TCCCCCCCTT CCCGGAGCAC
ATCGGTGCCA GCAACGCGAC CTGCGCCACC GGGGCCTACG ACGAGTACTG GAAGACCTTC
GGCCGCGTAC TGAACGCCTA CGGCCGGCAG AACTCCTACC TGCGAATCGC CTGGGAGGGC
AACGGCGACT GGTACGAATG GTCCGCCACC AACCCCAAGG ACTACGTCAA CTGCTGGCGC
CACGTCGCGG ACGCCATCAA CTCGACCGCG GAGCCGGACC CCACCCTGTG CTGGTGCCTC
AACGCTCACT ACTCGCAGAA CCCGCCGAGC CACAACCCGA TGGACATGTA TCCGGGCGAC
GCCTGGGTGG ACGGGGTCGG CCTCGACGCG TACGACCACT GGCCGCCGTC CCGGACAAAG
GCCGAGTTCG ACGCCCAGGC CAACGCCCCC GGTGGGCTGA ACTACTGGTT CAACTTCGCC
CGCGCGCACA ACAAGCTCTT CGGCGTCGGC GAATGGGGGG TGGTGAGTAC CAGCGGCAAC
AACGGTGGCG GTGACAACGC CAATTACATC CAGTGGATGT ACGACTGGTT CGTGGCCCAT
GCCGGCAAGG GGCTGGCCTA CGAGTATTAC TTCAACAACT GCGACCCGAA CAACGTCGGT
TCGAACCTGT ACCGACCGCT CAGCGCCACG TGTCTCTACC TGAACCGGCA GGCGGGCGCC
CGCTATAAAC AGCTCTACTC CGGGCAGTAG
 
Protein sequence
MRTENKRDGT ASQPRRYSNG RHARTRARRH RIGGLLLTLT VALMTGALTG PARAQAPTII 
EDSAIGTGLN QVSYTGTWTR CTGCIAGPLD DGFRYSSTYG SVASFQFIGT RVTIFGVKGP
WSGQAAISID GSTPQLVDTF ATVAVASSIF TSNTLTAGTH TVQITNLHQR NAASRGYDVA
FDRAEAINEV APPPPTPPVP TALTIEDTTI GTGTNQVSYS TGWTKCTGCI TSPNNSFYYS
ATAGAVATIR FSGTQINIYG AKGPIGGFST IRLDGGQPVN VDTYAPTSSI TLFYSSDKLA
AGTHTLTLTN TGQRNTSSQG NNVGFDRAEV TTGANPPPAP PSYAGPRSGK GWLTGTYPDP
VMNQQNLEAF CTWRGAPCDF TLLYTTRNSW ANVSQPVDLL RTFANWPGRL IISIPPFPEH
IGASNATCAT GAYDEYWKTF GRVLNAYGRQ NSYLRIAWEG NGDWYEWSAT NPKDYVNCWR
HVADAINSTA EPDPTLCWCL NAHYSQNPPS HNPMDMYPGD AWVDGVGLDA YDHWPPSRTK
AEFDAQANAP GGLNYWFNFA RAHNKLFGVG EWGVVSTSGN NGGGDNANYI QWMYDWFVAH
AGKGLAYEYY FNNCDPNNVG SNLYRPLSAT CLYLNRQAGA RYKQLYSGQ