Gene Franean1_0449 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0449 
Symbol 
ID5668871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp529072 
End bp530640 
Gene Length1569 bp 
Protein Length522 aa 
Translation table11 
GC content64% 
IMG OID641239381 
Productextracellular solute-binding protein 
Protein accessionYP_001504819 
Protein GI158312311 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0747] ABC-type dipeptide transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATAGACG GCGCATTATC CCGCAGGCAG GTACTCGGCG CCGCACTCGC TACGGTGGCA 
CTCGCCGGCG CGAGTTCGTG CGCCGGCGAG GAACAATCGT CCGGCGACGG TGGCAGCGGA
CGGCCGTCCG CGTCGCGCGA ACAGACATTG TCCCTTGCCA TCCAGGCCAC TCCGAATTCC
TTCGATCCCG CTGAACTGAC AAGCGGCCAG TCCTCGTTCG TGTGGAGCGC CCTCTACGAC
ACCCTGATAT GGAAGGACAA TAAGGGCAAG TTACAGCCCA ACGCGGCAGA GAGCTGGAAC
TACTCCGACG GCGGGCGCAC ACTGACATTG AAGCTGCGCA AGGGAATGTC CTTCAGCTCG
GGTGCCCCGG TGAACGCGGT CGCGGTGAAG ACCACCCTCG AGCGGAGCAA GAACACGCCA
GGATTCACCG ACCAAGTCCT CGGCGCGCTC GAATCGGTCG ACGCGCCCGA CGACCGCACC
GTCGTCCTCC GACTCTCCCA TCCGGACGGC GCACTGCTGG ACTCGCTGGC AGTGAGCGGC
GCGGGCGTGA TCGGCGATCC AGCGACGCTG AACGACAAAC GTACCGCGCT GAACCCGGTC
GGTTCGGGCC CATACGTCCT GAACACCGGA CAGACGGTGA ACGGATCGAC CTACGTGCTC
GATCGCCGCG AGGACTACTG GAACGTTCAG GCGTACCCGT TCAAGACCGT CAAAATCTCG
GTCATCCGGG ACCGAACCGC TGCCCTCAAC GCCCTGCAGG CGGGTGAGGT CAACGCCGGT
ACCGTCGAGG TGACAAACGT GGACCGGCTG CGGGCGGCCG GCTTTGACGC CGCCGTCGTC
GAGGCCCACT CGCTGGCCTC GCTGGTTCTC GCCGACCGTA CAGGGGAGTC GCTCAAACCG
CTGGGCGATC CACGGGTCCG GCAAGCCATC AACATGGCGT TCGACCGCGA GAAGATCGTC
GAACAGCTGC TCAGGGGCTC GGGTAAGCCG ACCGAGCAGG TGTTCAACCC CAAGGACCCG
GCGTATGACC CGGCACTGAA CACGACGTAC GCCTACGATC CACAGCGCGC GAAGAGACTG
CTGGCCGAGG CCGGATATCC CAACGGATTC TCGGTAACGA TGCCGGAATT TTTCTTAGCC
AAGTCGTTCG CACCGACAAT CACCCAGTCC CTGGCCGCTA TCGGAATCAC GACGACGTGG
GAACCGGTTC CCCCACAGCA GACCGACGCG GCGATCAGCT CGAAGAAATA TCCGGCGTTC
TTCCTAATTG CCGGGCTGGA GACGACTGCG GGTGACGCGT CCAGATATTT CTCCAAGGAC
GGAGCGTTCA ACCCCTTCCA CGCGGAGGAT CCGGACCTCA CGCCACAGGT GGAGCAGGCG
ACTCAGACAA TTGATCCGCG GCAGGCAGCC GATGCCTACA GGCATGTCAA CGCCACCGCG
GTCCGGGATG CGTGGAACGC CCCCCTCTTC TACGTCGCGG TCCACTGGGT AACCAAAAAA
GGCATCACCT ATCTCGGTGA CGGCTCGCTG ACGTTCAACA CCGTTCGCGC CTTCGGCCTG
TCCGGATAA
 
Protein sequence
MIDGALSRRQ VLGAALATVA LAGASSCAGE EQSSGDGGSG RPSASREQTL SLAIQATPNS 
FDPAELTSGQ SSFVWSALYD TLIWKDNKGK LQPNAAESWN YSDGGRTLTL KLRKGMSFSS
GAPVNAVAVK TTLERSKNTP GFTDQVLGAL ESVDAPDDRT VVLRLSHPDG ALLDSLAVSG
AGVIGDPATL NDKRTALNPV GSGPYVLNTG QTVNGSTYVL DRREDYWNVQ AYPFKTVKIS
VIRDRTAALN ALQAGEVNAG TVEVTNVDRL RAAGFDAAVV EAHSLASLVL ADRTGESLKP
LGDPRVRQAI NMAFDREKIV EQLLRGSGKP TEQVFNPKDP AYDPALNTTY AYDPQRAKRL
LAEAGYPNGF SVTMPEFFLA KSFAPTITQS LAAIGITTTW EPVPPQQTDA AISSKKYPAF
FLIAGLETTA GDASRYFSKD GAFNPFHAED PDLTPQVEQA TQTIDPRQAA DAYRHVNATA
VRDAWNAPLF YVAVHWVTKK GITYLGDGSL TFNTVRAFGL SG