Gene Franean1_7335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7335 
Symbol 
ID5675636 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8972137 
End bp8974401 
Gene Length2265 bp 
Protein Length754 aa 
Translation table11 
GC content69% 
IMG OID641246172 
Productputative alpha-1,2-mannosidase 
Protein accessionYP_001511560 
Protein GI158319052 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3537] Putative alpha-1,2-mannosidase 
TIGRFAM ID[TIGR01180] alpha-1,2-mannosidase, putative 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.795016 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGGCCGG GTGCACACGG CAGAACTACA CGGGCATCAG ACCTCGCGAC TCAGACCGCG 
GACGCTGTCT CCGCAGGCGA CGACCTGGCC AAGTTTGTCG ACCCGTTTAT CGGCACTGCG
CAGCAGCGTC ATGATGTCAT ACCCGGCGAT GGGGTCGCGG GTGCTTTTCC AGGAGCCGCG
GCGCCGTTCG GGATGGTCCA GTGGAGCCCG GACACCAACG GACGGGCGGC TGCGGGGTCG
GGCGGGTACC GGTACACCGA CTCGGTAATA GACGGATTCA GCCTGACCCA CTTACATGGC
ACGTCGTGTC CTGCGGCCCA GGATATTTCG ATCATGCCGT CTACCGGCTC CTATCCCTTC
TATCCACCCA CCGGCGCCGT GGACGAGCCA TCGTTGTGGG GCGCCGGATT CTCTCATGGC
GACGAGTCGG CCACACCCGG TCGCTACAGC GTTCAGGTCA CCTCGCCGAC GGGTGATCGG
ATAGGTGCGG ATCTCGCCGC CGGAGCCCGT GCGGCGATTG GTCGCTTCTC GTTTCCGGCA
GGCGTGGCGG GATCCCTGGT CCTCGACGTG GGACGCCGTG CCGACGGCGC CGGCCGGGCG
GACCTGCATG TGGACTCCGC CACCGGGGAG GTTTCAGGCT CGGTGACCAG CCGGGTCTTC
TGCCGGCAAC CAGCCGTCAG CACGCTTTAC TTCGTCGCGA GACCTGATCG GCCGCCGGCG
GCTGTGGGCA CCTGGTCGGC GGGGCGATGG TCACTGAGCC CGGACGCGGC CGACGAGGGA
GGACCCACCG CGACGTCGCG TTCCGCATCG GTCGGAGCCT GGCTGAGAAT GCCACCGGCA
GCAGACGGAG CGGTCATGCT CCGTGCTGCT GTGTCGTATA CCAGCGTCGA GGGGGCGCGC
GCCAATCTCG AGGCCGAGTA CCCCAGCAGT GTCGAGGCGG TCGCGAGCGC CACCCGTGCC
GAATGGAACC GGAGGCTGTC GACCGTGTGT GTCGCCTCCG CGCCACGGTC CGAGTTACGC
ACCCTGTACA CAGCGCTCTA TCACGCGATG CTGGGCCCCG GTGCCGTCAG CGATGCCGAC
GGCACCTACC GTGGCTGGGA CGGATCCACC GCCGCCGCCC AGGGATGGAC CGCACGCAGC
GGGATCGCGG GCTGGGACGC CTACCGCAGC TCGTTCCCAT TACTGGGGAT GCTGGCTCCG
GACGTCGCTA CCGACACCGC ATGGTCGTTG ATAGCCGGCA CCATGCAGAC CGGGTCGGTG
CCAGGATGGA CTCTCGCGAC CAGTCGGCCC GACGTCATGC CTGGCGACGG CGGCACGGTC
CTGCTTGCCG AGGCGGTGGC TCTCGGGTTG CCCCTTGACC CAGCACAGGC TTTGGCCGCC
GCGCTGGCCA GCGCGGCCCG CCGCCCGTTC GCGGCCGAGT ACGACCGACT CGGCTTCGTC
CCGGTCGGCC CCACCGGCCC CGCGGATGCT GTGTCCATCA CTCTCGAGTA CGCCCTCGCG
GACTTCGCGA CAGCGCAACT GGCGCTTGCC GCGAACCGGC CCGAAATCGC GGCTACCTTC
ACTGAACGGT CGACGCGCTG GCGGTCGGTA TGGGATCCGG TCAGTAAGCT GCCCGCCCCG
CGCAGCACGG ACGGTGGCTT TCTACCAGTG GAGCCGACGA CCCTCCCCGG CTACACCGAA
GGCACCGCGG CCCAGTACTC ATGGCTGGTT CCGCAGGATC CCGATGGCCT CGCCGAGGCG
ATGGGCGGAC GGCGCGTCGC GGTTGACCGG CTCGCCGACT TCCTCGCCCG GCTCGACGAC
GGCCCGATCG GGGCACATGC GGAGATGGGA AACCAGCCCA GCCTCCTCGT ACCCTGGCTC
GGAGGATCGT TCGGCGCCGT CGATCTGACG ATCGACGCTC TCGACCGGAC GCGCCGGACT
TTGTTCCGGG ATACTCCCGA CGGTCTGCCC GGGAATGACG ACACTGGGGG CCTGTCCGCC
TGGTACGTGC TCGCCTGGCT TGGACTGGGC CCCGTCCAAC CGGGAACAGA CCTGCTCATC
GTCACTCCGC CGGTCGCCTC ATCGGTCGAG ATCACCCTTC CTCACGGACA GCTTCGGTTG
GTGAACCAGC AGGTCGGCGG GGGGGATGGT CGCATACCGG TCGGGCTGAG ACGTGACGGC
GAGTCGCTCC CCGGACCCTG GATCCGGTTT CGCGATCTCA GCGCAGGAGC GACTCTGACC
TGGTCACTGT CCGATCAGCC GACCGCCCGG GGTACGGAGA ACTGA
 
Protein sequence
MRPGAHGRTT RASDLATQTA DAVSAGDDLA KFVDPFIGTA QQRHDVIPGD GVAGAFPGAA 
APFGMVQWSP DTNGRAAAGS GGYRYTDSVI DGFSLTHLHG TSCPAAQDIS IMPSTGSYPF
YPPTGAVDEP SLWGAGFSHG DESATPGRYS VQVTSPTGDR IGADLAAGAR AAIGRFSFPA
GVAGSLVLDV GRRADGAGRA DLHVDSATGE VSGSVTSRVF CRQPAVSTLY FVARPDRPPA
AVGTWSAGRW SLSPDAADEG GPTATSRSAS VGAWLRMPPA ADGAVMLRAA VSYTSVEGAR
ANLEAEYPSS VEAVASATRA EWNRRLSTVC VASAPRSELR TLYTALYHAM LGPGAVSDAD
GTYRGWDGST AAAQGWTARS GIAGWDAYRS SFPLLGMLAP DVATDTAWSL IAGTMQTGSV
PGWTLATSRP DVMPGDGGTV LLAEAVALGL PLDPAQALAA ALASAARRPF AAEYDRLGFV
PVGPTGPADA VSITLEYALA DFATAQLALA ANRPEIAATF TERSTRWRSV WDPVSKLPAP
RSTDGGFLPV EPTTLPGYTE GTAAQYSWLV PQDPDGLAEA MGGRRVAVDR LADFLARLDD
GPIGAHAEMG NQPSLLVPWL GGSFGAVDLT IDALDRTRRT LFRDTPDGLP GNDDTGGLSA
WYVLAWLGLG PVQPGTDLLI VTPPVASSVE ITLPHGQLRL VNQQVGGGDG RIPVGLRRDG
ESLPGPWIRF RDLSAGATLT WSLSDQPTAR GTEN