Gene Franean1_4702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4702 
Symbol 
ID5673044 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5617369 
End bp5618493 
Gene Length1125 bp 
Protein Length374 aa 
Translation table11 
GC content67% 
IMG OID641243559 
Productamidohydrolase 2 
Protein accessionYP_001508975 
Protein GI158316467 
COG category[R] General function prediction only 
COG ID[COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00593918 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATTACA AGGTCATCTC GGCGGACAAC CACATCATCG AGGCGCCGCA CACGTTCACC 
ACCTATCTGC CGAAGGAGTA CCGGGACCGG GCGCCGCGCA TCCTGCGCGG CGCGGACGGC
GGCGACGGCT GGAGCTTCGA CGGCAAGCCG CCGAGCAAGA CGTTCGGGCT GAACGCCGTA
GCCGGCCGCC CGTTCGAGGA CTACAAGGCC AGCGGCCTGA CCGTCGATGA GATCCTCCCG
GGCAACTACG ACGGCGCCGC CCACCTGAAG GACATGGACG CCGACGGCGT GGACGCCGCC
ACCATCTACC CGATGGCCTC TCTCACGTCG TACACGCTCG ACGATCGACC CTTCGCCCTC
GCCATCCTGC GGGCCTACAA CGACTGGCTG CTCGACGAGT TCTGCGCCGT CAACCCGCAG
CGGCTCATCG GTCTGCCGCT CCTGCCGGTC GACGACGGCA TGGACGTCCT GCTCGCCGAA
CTCGAGCGGG TGGCTGCCAA GGGTGCCAAG GGCGCCTTCC TCCCCTACTG GAGCGAGCGC
CCGTACTACG ACAGCTACTA CGAGCCGCTC TGGACGGCGG CCGAGCAGGC ACCACTGACG
CTGTGCATCC ACCGAACCAT GGGCGGGAAG GAACCGGCGG GGCAGGCCAC CCCAAGGCCG
GAGGCCGCCG CGGGTGTCAA CCTCGCCGGT ATCGTCCAGC GGTTCTTCAC CGGCGTCGCG
CCGTTCTCCC AGCTGACCTT CACCGGTGTG TTCGAACGGC ACCCCGGCCT GAAGTTCGTC
GACGCCGAGG TCAACTTCGG GTGGCTGCGG TTCTGGGCCC TGATGATGGA CCAGGAGTTC
GAGCGCCAGA AGCACTGGGC CAACCCGCCG CTGCACACCC CGCCCCACGA GTTCATCGGC
AAGAACCTTT TCGTCAGCGT GCTCGACGAC TTCGTCGGCT TCGAAGACGC CAAGCGCGAC
CCGCTCGTGG CGTCGGCCGC CATGTTCTCC ATCGACTACC CGCACAGCGG GACGCTGTTC
CCGAAGACCC AGCAGTACAT CGCCGAGCTG ACCCCAGGCC TCGACGACGA CCGCAAGCAC
GCCATCCTCG CGGGGAACGC TGTGCGGGTG TTCAACCTCG CATGA
 
Protein sequence
MDYKVISADN HIIEAPHTFT TYLPKEYRDR APRILRGADG GDGWSFDGKP PSKTFGLNAV 
AGRPFEDYKA SGLTVDEILP GNYDGAAHLK DMDADGVDAA TIYPMASLTS YTLDDRPFAL
AILRAYNDWL LDEFCAVNPQ RLIGLPLLPV DDGMDVLLAE LERVAAKGAK GAFLPYWSER
PYYDSYYEPL WTAAEQAPLT LCIHRTMGGK EPAGQATPRP EAAAGVNLAG IVQRFFTGVA
PFSQLTFTGV FERHPGLKFV DAEVNFGWLR FWALMMDQEF ERQKHWANPP LHTPPHEFIG
KNLFVSVLDD FVGFEDAKRD PLVASAAMFS IDYPHSGTLF PKTQQYIAEL TPGLDDDRKH
AILAGNAVRV FNLA