Gene Franean1_5674 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5674 
Symbol 
ID5674001 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6887281 
End bp6888702 
Gene Length1422 bp 
Protein Length473 aa 
Translation table11 
GC content70% 
IMG OID641244528 
Productamidohydrolase 
Protein accessionYP_001509931 
Protein GI158317423 
COG category[F] Nucleotide transport and metabolism
[R] General function prediction only 
COG ID[COG0402] Cytosine deaminase and related metal-dependent hydrolases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTGGCG CGGAGCTTGT CGCCACGGTC GATGCCGACC GCCGAGAGAT CCCAGGCGGC 
TGGGTAGCAA TCACCGACGG CCTCGTCAGC TCCCTCGGCG GCCCGGCGGA GACACCCCCG
ACCGCGACCC GGACCCTGCG TGCCGACGGC TGTCTCATAA CGCCGGGCCT GGTGAACACA
CATCATCACA TCTACCAGAA CCTCACCCGT TCCTTCGCTC CGGCACTCGG CGGCACCCTT
TTCACCTGGC TGACCACGCT CTATCCACTC TGGTCACGGC TGGACGAGGA GGCCGTCCAC
ACCTCGGCCT ACGTGGGCCT GACCGAACTG GCGCTCGGCG GCTGCACGAC ATCAACAGAC
CACCTCTACG TGCACCCGCG CGGTGGCGGC GATCTCATCT CCGCCGAGAT CGCGGCAGCC
CGGACCCTGG GCATGCGGTT CCACCCGACC CGCGGCTCGA TGTCGCTTTC GGTCAAGGAC
GGCGGGCTCC CTCCTGACTC TGTCGTCCAG GACGCGGATG AGATCCTCGC CGACTCCGCC
CGGCTTGTGG CCCAGCATCA CGACCCGTCC CACGGCGCGA TGGTGCGGAT CGCCCTGGCC
CCCTGTTCGC CGTTCTCCGT CAGTCCGGAA CTCATGCGGG CCACCGCGGA ACTGGCCGAG
TCCTTGGACG TCCGGCTACA CACGCATCTC GCCGAGGACC CCGAGGAGGA CGACTACTGC
CTGGCGGTGT TCGGACGGCG TCCGATCGAC CAGTTCGCGG AGGTTGGCTG GGGCGGCGAC
CGGGCGTGGG TCGCGCACTG CATCTGCCCC AATGACGAGG AGGTCGAGCA GCTCGGCAGG
TGGGGCACAG GGGTGGCCCA CTGCCCGAGC AGCAACATGA TTCTCGGCGG CGGCCTCGCG
CCCGTGGCCG AGCTCCGCTC GGCCGGCGCC CCGGTCGGCC TGGGCTGCGA CGGGTCGTCA
TCCGCCGACT CCGCCTCGCT GTGGTTGGAG GCTCGTACCG CCATGCTGTT GGGCCGGCTG
CGACACGGCG CCGCGGCGAT GTCCGCCCGG GACGCGCTGG AGATCGCCAC CCGGGGCGGC
GCAGGCTGTC TCGGCAGGAC CGGTGAGATC GGTGAGCTCT CCGTCGGGTC TGTCGGCGAC
CTCGTCGTCT GGCCGTTGGA CGGGGTCGCG TACGCGGGAG CGCTCTCCGA TCCGATCGAC
GCCTGGCTGC GTTGCGGGCC CACAGCGGCC CGGCACACGA TCGTGGCCGG CAGGCTGGTG
GTGGAGAACG GAGTGCCGGT CCATCCTGAT CTCGACGAGA TGCTCGTCCG GCACCGCCGG
ACCGCCGGCG GCATCCAGGC GGCGTTCGAC GATGCGGGCA TCGATCCGAC CGTTCCCATC
AATACCGGCG GCAGCAGCGT CGGGGCGGCA AAATCACTTT GA
 
Protein sequence
MIGAELVATV DADRREIPGG WVAITDGLVS SLGGPAETPP TATRTLRADG CLITPGLVNT 
HHHIYQNLTR SFAPALGGTL FTWLTTLYPL WSRLDEEAVH TSAYVGLTEL ALGGCTTSTD
HLYVHPRGGG DLISAEIAAA RTLGMRFHPT RGSMSLSVKD GGLPPDSVVQ DADEILADSA
RLVAQHHDPS HGAMVRIALA PCSPFSVSPE LMRATAELAE SLDVRLHTHL AEDPEEDDYC
LAVFGRRPID QFAEVGWGGD RAWVAHCICP NDEEVEQLGR WGTGVAHCPS SNMILGGGLA
PVAELRSAGA PVGLGCDGSS SADSASLWLE ARTAMLLGRL RHGAAAMSAR DALEIATRGG
AGCLGRTGEI GELSVGSVGD LVVWPLDGVA YAGALSDPID AWLRCGPTAA RHTIVAGRLV
VENGVPVHPD LDEMLVRHRR TAGGIQAAFD DAGIDPTVPI NTGGSSVGAA KSL