Gene Franean1_6846 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6846 
Symbol 
ID5675159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8345936 
End bp8347141 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content66% 
IMG OID641245695 
Productamidohydrolase 2 
Protein accessionYP_001511086 
Protein GI158318578 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.206422 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGTTCA GCTGGTTCAT TTCGGTCGAC GACCACCTCA TCGAGCCGGC GCGGCTGTGG 
CAGGAGCGCC TGCCACAGCG CTGGCGCGAC ACCGGGCCCC GCATCGTGCG GGACGGGAAG
TCGGAGTTCT GGGTCTACGA GGACCGCCAG ATCGTCACCA CCGGCCTGAA CGCCGTGGCG
GGCAAGACCC GCGAGGAGTT CTCGCCGGAG CCGATCTCGT ACGACGACAT GCGCCCCGGC
TGCTACGAGC CCGCGGCCCG GGTGGCCGAC ATGAACCAGG GCAACGTGCT GTCGTCGATC
CTGTTCCCGT CGTTCCCGCG GTACTGCGGC CAGGTCTTCC ACGAGGCCAA GGACAAGGAG
CTCGGGCTGC TCTGCGTCCA GGCGTGGAAC GACTTCATCC TGGAGGAGTT CGGCGCGGCC
TACCCCGGCC GCTTCATCCC CATGATGATC ATTCCGTTGT GGGACCCGGT GGCAGCGGCG
GCCGAGATCC GGCGGACGGC GGCCCGCGGC GGCCGGTCGA TCGCCTTCTC GGAGAACCCG
ACCAAGCTCG GTCTCCCGTC GATCCACACC GACTTCTGGG AGCCGATGTT CGAGGCCTGC
AACGAGACCG GCTACGTGAT CTCGATGCAC GTCGGGTCGT CGTCCAACCT GATCCGCACC
TCGCCGGACA TGCCGACGCT GGCCTTCATG GCCTACTCGG CGGCGGCGAA CCAGGCCGGC
ACGTTGCTGG ACTGGCTGTT CAGTGGCATT TTCGACCGGT TCCCGAACCT CAAGATCGCT
CTTTCCGAGG GCTCGATCGG CTGGATTCCG TACTTCCTGG AGCGGGCTGA GCAGGTCATC
GACAAGCAGC GGTTCTGGGC GTCGCGGTTC GATATCGACA TGAACGCCTC CCACGAGCGC
GGTGAGGCCA AGGGCGAGGC GAAGTTCAAC CTCGACACCA ACATTCGCCA GCTCTTCGCC
GACCACGTTT TCGGCACCTT CATCGAGGAC CAGGCCGGCG TCCGCCTGCT CGACATCATC
GGTGAGGACA ATGTGATGCT CGAGTGCGAC TACCCGCACT CGGACTCCAC CTGGCCGGAC
ACCGTGAAGC TGGCCGGCGG CTGGCTCGGG CACCTTTCCG ACGAGGTCCA GCACAAGATC
ACGATCGGGA ACGCGGCCCG CGTCTACAAC TTCACGCCTG CTGACCCGGC GACCATCACG
CTGTGA
 
Protein sequence
MSFSWFISVD DHLIEPARLW QERLPQRWRD TGPRIVRDGK SEFWVYEDRQ IVTTGLNAVA 
GKTREEFSPE PISYDDMRPG CYEPAARVAD MNQGNVLSSI LFPSFPRYCG QVFHEAKDKE
LGLLCVQAWN DFILEEFGAA YPGRFIPMMI IPLWDPVAAA AEIRRTAARG GRSIAFSENP
TKLGLPSIHT DFWEPMFEAC NETGYVISMH VGSSSNLIRT SPDMPTLAFM AYSAAANQAG
TLLDWLFSGI FDRFPNLKIA LSEGSIGWIP YFLERAEQVI DKQRFWASRF DIDMNASHER
GEAKGEAKFN LDTNIRQLFA DHVFGTFIED QAGVRLLDII GEDNVMLECD YPHSDSTWPD
TVKLAGGWLG HLSDEVQHKI TIGNAARVYN FTPADPATIT L