Gene Franean1_1708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1708 
Symbol 
ID5670110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2039887 
End bp2041662 
Gene Length1776 bp 
Protein Length591 aa 
Translation table11 
GC content77% 
IMG OID641240626 
Producthypothetical protein 
Protein accessionYP_001506052 
Protein GI158313544 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.864094 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0229836 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCTCCG CCTCACGATC CACATCCCTG CCGGGGATTA CCCCGTTCGC GCTGGCCGGC 
CCCGTGCTGG CCGCCGGTGG CAAGGATCAG ACGGGCCTGC ACCTGCTCCT CGCCTTCGGC
CTGTTGCTGC TGGTCGTGGC AGTCGTGGGC GCGATGCGAC GGGCCTGGCG CGGCCGTGCG
CAACAGCAGG AGGAGAACCT GCCGGACCTG CCCGAGCCGC CGGAGCAGAC CGGGAACGTG
CTGGCCGCCC CGCTGCGTGG GCGCTACCTC GGCACGGTGG ACGCCGGCCA CTGGCGGGAG
TGGATCGCCG CGCGCGGCCT GGCCGGGCAC GACGGCGACT ACATCGCCGT CTACGAGCTT
GGAGTGCGGG TCGACCGGGA CGGCGAGGCG TTCTGGATCC CCCGGGAGGC CGTGCGGGGC
GCCCGCCTCG AGCGGGCGCA CGCCGGGAAG GTCGCCGCGC CGAGCCGGCT GATCGTGGTC
GCGTGGTCGT TCGAGGGCCG GGAGCTGGAG GCCGGCTTCC GCGGCGAGGA CCGGGCCCGC
CAGCCGAAGG TCGTCCGCTC CGTGCACGAC CTCATCGGGC CGGCGCCCGC CCAGCCGATG
TCCGGCGACA TCACCTCGCC GCACGCCCTG CCCCGGCCGC GCAACCGGCT GCGGCCCCGC
GTGCCGGCGC CGGCACGCCC GGCCGAGCCC GGCGCGCCGG CCGCCGCGGC CCCCGCCCGC
GGGCAGCGTC ATGATCTCGC CGCGGTGGCC CCGGGCGGGC CCGCGACGAT GCCGATCCCG
GTCAACGGCC GCCAGCCCCG GTCGGAGCGG GCCGGGTGGC GCCGCGGTGG CGCCGCGGCC
AGTCAGGGCG CCGCCGCCGA GACCCACGCC GGCCAGCGCG GATACAGCGC GGACGCGCCC
GGCCCGGTCG CCTCCGGCGG CTACGACACC GCTGCCCACG GGACGGGTGC CCTTAGGACG
GGTGCCCAGG ACACGGGCGC CTACGACATC CGCGCCCACG TCACCGGCGC GCACAGCATC
AGTACCGACA GTACCGGCGT CCACGGCACC GGCGCCCACG GCGCCAGTGC GTACGACACC
GGTGCGTACG ACACCGGCTC CCACGGTGCT GGCGGGTACG GGACGGGCTC GCACCGGACG
GGTGCGTACG ACACCGGCGG TTACCGGCCA GGTGCCCACG ACGTGGGCGC CCAGGGCTCC
CAGGCCGTGA GCGGCGGGAC GGCCGGCTAC GACACCGCCG GCTACAGCAC AGCCGGCTAT
GACACGGCCG GGTACGGGAC TGACAGCTAC GACCTGCGTC GCCAGGGCAC CGGCGGGCTG
GACGCGCGCG GCCACGCCAC CGGCGGGTAC GGGCGCCCCG GCGCCCCCGG GCCGGCCGCT
CCCGACCAGG GGGGGCTCGA TTCGGCCGCG TACCACCTGG GTGCCGGCGA CACCGGCGTC
CACGGCTCGG GTGCCTACGG CTCGGGTGCC TACGGCTCGG GTGCCTACGG CTCCGGTGCC
AACGACACGG GTGCCCACGA CTCGCGTGGA TACGGCCAGG GCGCATACGG CCAGGGCCGG
CGCGACCCGG GCGGACGTGA CCAGGGTGGA TACGACCGGG AGGCGTACGC CTCCCGCGGC
CGGGCCGGCA CGCCGCCGGC CGTCGGCGCG CCGGGTGACC AAGGGAACTA CTGGCGGACC
GGCGCCGACC CGGCGGAGCG GCCGCGAGAC CAGGGGACCG ACGCGTTCAC CGCACCGCCC
GGTGAGGCCT CGTACCGACG GGAGGAGTAC CCGTGA
 
Protein sequence
MTSASRSTSL PGITPFALAG PVLAAGGKDQ TGLHLLLAFG LLLLVVAVVG AMRRAWRGRA 
QQQEENLPDL PEPPEQTGNV LAAPLRGRYL GTVDAGHWRE WIAARGLAGH DGDYIAVYEL
GVRVDRDGEA FWIPREAVRG ARLERAHAGK VAAPSRLIVV AWSFEGRELE AGFRGEDRAR
QPKVVRSVHD LIGPAPAQPM SGDITSPHAL PRPRNRLRPR VPAPARPAEP GAPAAAAPAR
GQRHDLAAVA PGGPATMPIP VNGRQPRSER AGWRRGGAAA SQGAAAETHA GQRGYSADAP
GPVASGGYDT AAHGTGALRT GAQDTGAYDI RAHVTGAHSI STDSTGVHGT GAHGASAYDT
GAYDTGSHGA GGYGTGSHRT GAYDTGGYRP GAHDVGAQGS QAVSGGTAGY DTAGYSTAGY
DTAGYGTDSY DLRRQGTGGL DARGHATGGY GRPGAPGPAA PDQGGLDSAA YHLGAGDTGV
HGSGAYGSGA YGSGAYGSGA NDTGAHDSRG YGQGAYGQGR RDPGGRDQGG YDREAYASRG
RAGTPPAVGA PGDQGNYWRT GADPAERPRD QGTDAFTAPP GEASYRREEY P