Gene Franean1_3226 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3226 
Symbol 
ID5671601 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3810580 
End bp3813069 
Gene Length2490 bp 
Protein Length829 aa 
Translation table11 
GC content62% 
IMG OID641242119 
Productglycoside hydrolase family protein 
Protein accessionYP_001507539 
Protein GI158315031 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3250] Beta-galactosidase/beta-glucuronidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCCGTG TGTCGTTCAA CCGGGGCTGG ACCGTGCGTC CGCGGACCAG CCTCTTCGGC 
GAGGTGAGCG GAAATGCGCC TGCTGCTGCC GGAGTGACGT TGCCGCACGA TGCGATGATC
GTCCTTGACC GCTCCGCGGA GCACAGCGGG GGAGCCGAGA CGGGTTACTT TCCTGACGGT
GTCGTCGAAT ATGTGAAATC GTTCGAGGTG CCCGAATCGT ACCGGGACAA GCGAGTGATT
CTCGAGTTCG AAGGGGTGTA CCGGGACGCG CAGGTGTATG TCAACGGCGC CTTCGCGGGA
CAAAGGCCAT ATGGATACTC GCTGTTCCGT GTGAGTCTTG ATCCTTTCCT GCGTCATGGG
CAGGCCAACG AAGTGCGGGT CGAGGCGCGC GCTCACAGCG ATTCGCGTTG GTACAGCGGC
CTCGGCATAC ACCGCCCCGT GAGTCTCATC GTGAACGAAC CGGTGCACAT TGCGGCCGAT
GGGGTGCGAA TCACGACGCC AGACGTAGAC GACGAACGGG CGGTCGTCGC GGTCGCGACC
GACATCGTGA ACTCGACTCC GCATACCGTC ACGGTCGAGC TCGTCTCCGA GGTCATGGCT
GACGGCGAAC GGGTCGCAAG GAACGCTGCC CCCGTCACCG TCCGTCCCGG CGAAACGGCG
ACGGCAAGGC AACGGCTGTA TGTCGGTAGC CCTCGACGGT GGGGAGTCGA CCACCCCCAC
CTCTATCGGC TCCGGACGGG ACTCTGGAAT GCGAACACAG GGCTCGAGGA GGTCGCGACA
GACTTCGGAA TACGCACTAT TCAGGTAGAC CCTACGCACG GTCTACGGAT CAATGGCGCG
ACGGTGAAGC TGCGCGGTGC GTGTATCCAT CACGACAATG GTATCCTTGG CGCGGCGGCC
ATTTCGCGGG CGGAGGAGCG ACGGGTCGAG ATCCTGAAGG GAGCCGGTTT CAACGCCGTT
CGGGCGTCAC ACAACCCGCT GAGTAAAGCG ATGCTCCACG CGTGCGACAG ACTAGGGATG
CTCGTACTGG ACGAGGCCTT CGACGTTTGG ACCCACGCGA AGATGCCCTT CGACTACTCG
CTGGCTTTCC CTGAATGGTG GCAGCGAGAT ATCGAGTCGA TGGTCAAGAA GGACTTCAAC
CATCCTAGCG TGATCCTGTA CTCGATCGGT AATGAAATCC CCGACCTCGC GTCGCCGCTT
GGTTCGGCAT GGAGTCGGGA CATCTCCGAG AAGATCGGCG AGATCGACGG AACCAGGTTC
ACCACCAACG CGGTGAATGG CTTCCTCGCC GTCATGTCCG ATCTGGTGAA GCAGGTTGAT
GCACGTCCGG GTCCGGGGCA AACGGGTGGT CTCAACACGA TGATGGCCTC GATCGGCGAC
GTGCTGGACC AGATCAATGC ATCCGAAATC GTCAGTGAAC GTACGAGAGA AGCACACGCG
GTGGTCGACG TCTCCGGGCT GAACTACGGG GCCTCGAGGT ACGAGATTGA CCGCGATCTC
TTTCCTGACC GAGTCATCCT GGGAACGGAG AGCCACGGAC CTCGGATCGA CCGGATCTGG
CATCTGGTCG AGGAAAACAG TCGCGTCATC GGCGACTTCA CGTGGACTGG GTGGGACTAT
CTCGGTGAAG CTGGCATCGG TCGTGTGTCA TATCCGGGCG AGCAGTCGGC TGAGGGCGGT
CTCCTAGCCC CCTTTCCCTC GATTCTTGCC GGTAGCGGAG ATATCGACAT CACTGGCTTC
CGGCGGCCGA TGTCATACTT TCGGGAAACG GTGTTCGGCC TCAGGAGTCA GCCGTACGTC
GCTGTGCACC GTCCCCAGTA CCACGGTAGG AAACGTGTTG CCGGGCCGTG GTCCTGGAGT
GACACGGTAT CGAGCTGGGC CTGGGGCATC GCCTCTGGTT CTCCGGTCGT CGTCGAGGTT
TACAGCAACG CGGAAGAGGT GGAACTTCTC CTCAATACCC GGTCATTGGG CCGTAAGTCC
GTTGGCAGTG AGCGGGCCTT CTCGGTGTTC TTCGACTGTG CGTACGAACC CGGGCGGCTT
ACCGCGGTGG CCTACACGGC GGGCGTCGAG CAGGGCAGGA CCTCGCTCCA GTCCTCGGAG
GGGGGTACCC GGTTGGCGGT GCGCGCCGAC CGAACGACAT TACGAGCCGA CGACACGGAT
CTCGCCTTCC TGGAGATCGA ACTCGTTGAC ACCGCCGGGA CCTGCCTGGT GGATCGTGAA
CTCGAGATAG TCGTAGTCGT CGAGGGCCCG GCTGTTCTGC AGGCGTTGGG TAGCGCTCGT
CCAGACACAG AAGAGAGGTA CGACGGACAC ACACATCGGA CCTATGACGG GCGGTTGCTC
GCGGTTGTAC GGCCGGTAGC TGTTGGAGCT GTGGTCGTGA AGGCAACAGC AGAGAGCCTC
GCCAGCGAAG TCATACGGCT AGATGTCGTC CCCTCCGATT CCCCGCCGAC GAAACCGCCG
CCAGTGATTT CCGTGAATCG ACCGGCGTAG
 
Protein sequence
MTRVSFNRGW TVRPRTSLFG EVSGNAPAAA GVTLPHDAMI VLDRSAEHSG GAETGYFPDG 
VVEYVKSFEV PESYRDKRVI LEFEGVYRDA QVYVNGAFAG QRPYGYSLFR VSLDPFLRHG
QANEVRVEAR AHSDSRWYSG LGIHRPVSLI VNEPVHIAAD GVRITTPDVD DERAVVAVAT
DIVNSTPHTV TVELVSEVMA DGERVARNAA PVTVRPGETA TARQRLYVGS PRRWGVDHPH
LYRLRTGLWN ANTGLEEVAT DFGIRTIQVD PTHGLRINGA TVKLRGACIH HDNGILGAAA
ISRAEERRVE ILKGAGFNAV RASHNPLSKA MLHACDRLGM LVLDEAFDVW THAKMPFDYS
LAFPEWWQRD IESMVKKDFN HPSVILYSIG NEIPDLASPL GSAWSRDISE KIGEIDGTRF
TTNAVNGFLA VMSDLVKQVD ARPGPGQTGG LNTMMASIGD VLDQINASEI VSERTREAHA
VVDVSGLNYG ASRYEIDRDL FPDRVILGTE SHGPRIDRIW HLVEENSRVI GDFTWTGWDY
LGEAGIGRVS YPGEQSAEGG LLAPFPSILA GSGDIDITGF RRPMSYFRET VFGLRSQPYV
AVHRPQYHGR KRVAGPWSWS DTVSSWAWGI ASGSPVVVEV YSNAEEVELL LNTRSLGRKS
VGSERAFSVF FDCAYEPGRL TAVAYTAGVE QGRTSLQSSE GGTRLAVRAD RTTLRADDTD
LAFLEIELVD TAGTCLVDRE LEIVVVVEGP AVLQALGSAR PDTEERYDGH THRTYDGRLL
AVVRPVAVGA VVVKATAESL ASEVIRLDVV PSDSPPTKPP PVISVNRPA