Gene Franean1_1979 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1979 
Symbol 
ID5670380 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2378500 
End bp2379699 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content71% 
IMG OID641240900 
Productglycosyl hydrolase 53 protein 
Protein accessionYP_001506322 
Protein GI158313814 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.861107 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCAGGG TCGGCGTGAC CGATTACGAC GCCGGGGAAC GGCGACGCAC CGCGAACAGC 
GCGGCAACCG CGTTCGTGTT CGGGGCGCGG ACGCATCGGG CGGGTTCGAA GGCGATCATG
CGTCGCACGC CGAGCATGCT AAGGCTGCTG TCGATGGCGT TGCTCGCGAT GGTGGCGCTC
GCGGGCTGCG TTACCCCCAT CGGGGGTGGA GGCGGCCCCG CTCCGACGAC CAGTGCGGCC
CCCAGCCCCG CCGGCCCGAC GGGCTCGCCG TCCGCCGGCC CCACCCAGGC TCCGGACCCG
AGCTCGGTTC CGAACCCCGG CGTCACCCCG GTGCCGACCC TCACCTCGGC GCCGGTGACC
GCTGCCCCGG TGACCACTGC CCCGGTGACC ACTGCGCCGG GGGCCACGCC GAGCGCCGCG
CCAGTGCCCA CGACACCCGG GTCGACTGAT CCGCCTATCG GGACGGTTGC GAAGGGCGCG
AGCACCTGGT ACTTCGACAA GATCGCTCCG TCGATGACGG AGGCGGGAGT CTCGTGGTTC
TACACCTGGG GAGCCGCGCC GGAGCGGATC GCGGCGCCGG CGGGAGTCGA GTTTGTCCCG
ATGATCTGGG GGCCGGGCTC GGTCACTCCG CAGACCCTCG CGACGGTGAA GGCCAACGGC
CGCACCCTGC TCGGCTTCAA CGAGCCTGAC CTCCGCGGCC AGGCGGACAT GCCGGTTCAG
ACCGCTCTCG ATCTGTGGCC GCAGCTGGAA GCCACCGGAA TGCGTCTGGG TAGCCCGGCG
CCCGCCGCCG GCGCCGCGGA CCCGAACAGC TGGTTCGGCC AGTTCATGGC CGGCGCGGCC
CAGCGGGGCT ACAAGGTCGA CTTCATCGCG CTGCACTGGT ACGGCAGCGA CTTCGACCCC
ACCCGGGCGA CCGGTCAGCT CCGTGCCTAC ATCCAGGACG TGTACGACCG ATACCACCTG
CCGATCTGGC TGACCGAGTA CAGCCTGATG AACTTCTCAA CCTCACCCGC GACGGTCCCG
AGCGCTGAGG GTCAGGCGGC GTTCGTGACT GCCTCCACCG CGATGCTCGA GAGCCTTCCG
TTCGTTGAGC GCTACGCCTG GTTCGCGTTT CCCGCCAATC CCGACAGTCG GACCGGCCTG
TATGACGAGT CAGGCCAGCC GACCCCGGCT GGTGTCGCCT ACCAGGCGGC CGGCCGCTGA
 
Protein sequence
MRRVGVTDYD AGERRRTANS AATAFVFGAR THRAGSKAIM RRTPSMLRLL SMALLAMVAL 
AGCVTPIGGG GGPAPTTSAA PSPAGPTGSP SAGPTQAPDP SSVPNPGVTP VPTLTSAPVT
AAPVTTAPVT TAPGATPSAA PVPTTPGSTD PPIGTVAKGA STWYFDKIAP SMTEAGVSWF
YTWGAAPERI AAPAGVEFVP MIWGPGSVTP QTLATVKANG RTLLGFNEPD LRGQADMPVQ
TALDLWPQLE ATGMRLGSPA PAAGAADPNS WFGQFMAGAA QRGYKVDFIA LHWYGSDFDP
TRATGQLRAY IQDVYDRYHL PIWLTEYSLM NFSTSPATVP SAEGQAAFVT ASTAMLESLP
FVERYAWFAF PANPDSRTGL YDESGQPTPA GVAYQAAGR