Gene Franean1_1875 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1875 
Symbol 
ID5670277 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2253088 
End bp2254782 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content69% 
IMG OID641240797 
ProductTIR protein 
Protein accessionYP_001506219 
Protein GI158313711 
COG category[R] General function prediction only 
COG ID[COG2358] TRAP-type uncharacterized transport system, periplasmic component 
TIGRFAM ID[TIGR02122] TRAP transporter solute receptor, TAXI family 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.399913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCATCGG GGGACTGCGA CAACGGGGGC GCGGTGGCGA ATGTCGCCGG GACGGACTTC 
TTCGTCTCGT ACGCCACCGC CGACGAGGCG TGGGCGGAGT GGATCGCCTG GAACCTCGAG
GAGGCCGGGT ACAGCACCCG CATCCAGGTC TGGGATTTCG CTGTCGGATC TCATTTCGTT
CATGAAATGC ACCGGGCGGC AGGAGGCGCG GCCAGGACGG TTGCCGTGCT CTCCGCCGAG
TATCTCGCTT CGGCCTACGC CGAGGCCGAA TGGGCGGCGG CCTGGGCGCG TGACCCGATC
GGCATCCATC GCCGGCTGCT TGTTCTCCGG GTCGGCGAGT GCGAACAACC CGGCCTGCTG
CGACAGGTCG TCGGGACGGA CCTGTTCAAC GTCGACGAGA ACACCGCCCG GGAGCGCGTG
CTGGCGGCGG CCCGCGGACA GCGGCACAAG CCGAAGTCAC CACCGACCTT CCCCGGCAGG
CCCGCCACGG CGACGGCGTC CGCGAGTACG CCCACACCGG CACGACGGCC GCCTCTGGCA
CGGCCGAAGT TTCCCGGCGG CCGGGTACGG CGCGGACCGT GGCCGGTCCC ACGGGTGCTG
ATCGCGCTGG TGCTGCTCGT GGCTCTCACC GGCGGTGGAA TTGTGGGCAC CGGCGCCCTC
AAGAACTGGC TGGGCGGTTC CGGCCCGTCA CCGTATGCCG ACACCCGCTG TACCGGCGTG
CAGGTACTGG GCGGCCTGGC CGGTTCGCCC TACAGCCTGT ACGCCCAGAC GCTCGCCGAC
CTTATCAACG ATAAGGTGCT CAACGGCGGC AACGACTGGT CGGCCACCGC TGATCTGAGC
TTGTCGGGGA CATCCGCCAG CCTGCGCACC CTCGCCGCGG ACCCGAAGTG CACCCTCAGC
CTCGCCCAGC TCAACGTCCC GGTGGACGCG GAACGCGGGC TGGGGGATTT CCGTCCGGAA
AACGGCGGTC CCATCGCCGG TCTCCGCTAC GTCGGACAGA TCTACTACGA CATGGTGCAC
CTCATCGTGC CGCCCGACTC GCCCATCAGT GGACTCGTCG ACCTGTGTGG GAAGCGGGTT
CTCACCGGAC GGGACAGATC GGGCACCAAC CAGATCAGCA CTGTGCTCAT CCGGTACGTG
CAGGACACCT GGAACTGCAA GGTGGACAGC AGCCTCGACG AGGGGCTCAA GGCGGCGCTG
CCACGACTGG ATCCATCCTC GACGGCCCGG GTGGACGCCG TCTTCTGGGC GGCCGGTGCC
CCGACGAAGC TGATCAGCGA CTACCTCGCA AAGGGGCACA TGATCAGGCT CGTCCCGCTG
AACGACGGTC GCGTCGCCGT CGCCAAGGAA TGGTCCACCG TCTACCCGGA CGTGAGCCAG
CCATTCGTGG CACTGCCGCT CGGCCCGGAA TATCCGGGGG TGGCCAAGGT CGGCACCTTC
GGAACGCCGA ACGGCGTGGT CGCGATGAAA TCGACGGACC GGCCGCTCGT CGCCTTTGTC
GCGCGGATGC TCCAGGAGCG GAGGAAGGAC TTCGAGCATG ACCTGTGGCC GGCGCACCCA
CCGGGTTGGT TTCCCACCCC GTGGTCGTTC GCCGATTCCG GTCTGTGTCA GGCGGTCCCG
CTGCACGACG CGGCCTACAA GGCGTACCTC CAGGCCCCGG ACGGGCAGGC GCCCTCCGGA
TGCCGGGTCG GCTGA
 
Protein sequence
MASGDCDNGG AVANVAGTDF FVSYATADEA WAEWIAWNLE EAGYSTRIQV WDFAVGSHFV 
HEMHRAAGGA ARTVAVLSAE YLASAYAEAE WAAAWARDPI GIHRRLLVLR VGECEQPGLL
RQVVGTDLFN VDENTARERV LAAARGQRHK PKSPPTFPGR PATATASAST PTPARRPPLA
RPKFPGGRVR RGPWPVPRVL IALVLLVALT GGGIVGTGAL KNWLGGSGPS PYADTRCTGV
QVLGGLAGSP YSLYAQTLAD LINDKVLNGG NDWSATADLS LSGTSASLRT LAADPKCTLS
LAQLNVPVDA ERGLGDFRPE NGGPIAGLRY VGQIYYDMVH LIVPPDSPIS GLVDLCGKRV
LTGRDRSGTN QISTVLIRYV QDTWNCKVDS SLDEGLKAAL PRLDPSSTAR VDAVFWAAGA
PTKLISDYLA KGHMIRLVPL NDGRVAVAKE WSTVYPDVSQ PFVALPLGPE YPGVAKVGTF
GTPNGVVAMK STDRPLVAFV ARMLQERRKD FEHDLWPAHP PGWFPTPWSF ADSGLCQAVP
LHDAAYKAYL QAPDGQAPSG CRVG