Gene Franean1_2626 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2626 
Symbol 
ID5671020 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3107022 
End bp3108581 
Gene Length1560 bp 
Protein Length519 aa 
Translation table11 
GC content71% 
IMG OID641241542 
Producthypothetical protein 
Protein accessionYP_001506962 
Protein GI158314454 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones24 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTAGGT CCAGTCTGCG GGCGGAGCAG GACGAGTTGC GGGCGCGGAT GCGTGCGGTC 
GGTATGTCCC ACGACGAGAT CGCGATCGAG TTCGCCCGCC GCTACCACTA CCGTCCCCGC
GCCGCCCACC GCGTCGCGCA CGGCTGGACC CAGCAACAGG CCGCAAACCA CATCAACGCC
CACGCCGCCC GCACCGGCCT CGACCCCCGC GGCACCGCCC CCATGACTGC CCCCCGGCTG
TCGGAGCTGG AGAACTGGCC GCTACCGAAC AACCGCCGCC GGCCCACCCC CCAGCTCCTC
GCCCAACTCG CCGAGGTCTA CGACACCAGC ATCCACAACC TCATCGACCT CGACGACCGC
GAACACCTCA CCCCCGCCGA CACACTCCTG ATCAACACCA CACGCCGAGA CGCTCGATCG
ACGCCGCCGG CAGGGTCCCC AGTGGCACTG TCACCACCCG CGGTCCCCCG CTCCAGACCT
CCAGCCGGTC CGGGCTTCGA GTCGGAATAC GGCGGCGCGC CCAAGAGACT GCGTCTCGAT
GCGTCTGCTG GGAACATCGA AGGGGTGGAC GCTCTCGGCC GCCGCGGGTT CACCCTCCTC
GCCGGATCCG CACTCATGGC GGGCCTGGCG GGTAACGGCC GTGCCCGCCG CGTCGACCCG
GCGCTCGTCT CCTATTTCGA CGGCCAGCTG AAAGGCCACT ACCACGCGGA CATGCTGCTC
GGCTCCGGCG CGCTGATCGG CACAGTCGCC TCCCAATTCG AGGTCATCGC GCAGCTGGTG
GACACAGCGG ACGGGTCGAC CCGCCAGCGC ATGGCGAAGG TCGGCTCGTC GTTCGCAGCG
TTCGCGGCCT GGCTGTGGCT GGACGCCGGC GATCCGGTCG CCGCGATGCG CTGTCACGAC
GCCGCCTTGG AGCTGGCGCA CCGCTGCGGG GAACGCGACG CCGTCGCCTG CGCGCTGGTC
GACCGGGCGA TGGCCTTCAC CGACCTGGAA AACGCAGCGG CCGTGATCGA CCTGTGCCAG
GCCGCGCTGG TCGATGCCCA GCACCTCTCG CCCGAGGTTC AGGTGTTCGC CTTGCAGCAG
CAGGCGCACG GTGCCTCGCT GCGCGGTGAC CATCGCCAGG TCGATCTCCT GCTCGATCAG
GCCGGCCGAC TCGTGGACCA GGTCGACGTC GAGGAGTGGG GCACGGCCTG TCGCCGTACC
AACGGCTACG TCGAGGTGCA GCGTGCCACC TGCTACGGAC GGCTCGGACT GGCTGACGAT
GCCGACCGTC TCTGGCAGCA GATCATCCCC GCCGCACATC CCTCAGCCCG CCGCGACGTC
GGGGTCTGGT CGGCACGCCA TGCCGTCGCC GCCGCACAGC AACATGAGCC GGAACGGGCG
GTGGAACTCG CGCGCCACGC GACCGCGCTC GCGATGGAGA CCGGCTCCGC GCGGGCCCGG
CGAGAACTGG CCGCGGTCGC GGCGGCCATG GCCCCGTGGC GCACTCACCC CGTCGGCCAG
GATCTGGCGG AGGTGCTCGC GCCCGTTACC ACCGACGAGA CCGGGATGGA TCATGGCTGA
 
Protein sequence
MSRSSLRAEQ DELRARMRAV GMSHDEIAIE FARRYHYRPR AAHRVAHGWT QQQAANHINA 
HAARTGLDPR GTAPMTAPRL SELENWPLPN NRRRPTPQLL AQLAEVYDTS IHNLIDLDDR
EHLTPADTLL INTTRRDARS TPPAGSPVAL SPPAVPRSRP PAGPGFESEY GGAPKRLRLD
ASAGNIEGVD ALGRRGFTLL AGSALMAGLA GNGRARRVDP ALVSYFDGQL KGHYHADMLL
GSGALIGTVA SQFEVIAQLV DTADGSTRQR MAKVGSSFAA FAAWLWLDAG DPVAAMRCHD
AALELAHRCG ERDAVACALV DRAMAFTDLE NAAAVIDLCQ AALVDAQHLS PEVQVFALQQ
QAHGASLRGD HRQVDLLLDQ AGRLVDQVDV EEWGTACRRT NGYVEVQRAT CYGRLGLADD
ADRLWQQIIP AAHPSARRDV GVWSARHAVA AAQQHEPERA VELARHATAL AMETGSARAR
RELAAVAAAM APWRTHPVGQ DLAEVLAPVT TDETGMDHG