Gene Franean1_0519 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0519 
Symbol 
ID5668938 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp604272 
End bp605528 
Gene Length1257 bp 
Protein Length418 aa 
Translation table11 
GC content60% 
IMG OID641239448 
Producthypothetical protein 
Protein accessionYP_001504886 
Protein GI158312378 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATTTTCG GTAGATTGTC GATACGATGG ATAGTGTCGG TTGCAATTGT CGTTGTTGTG 
GCTGGCTGTA CTTTGCCGGG CAGTGGGAAA GAGCGACCAG TGGCGGTTGG CTGTGACAGC
CCGGGAGTCA CCTCCGACCA GGTAAAACTG GGGTTGGTGT TCTCTGACTC GGGTATTGGT
AGCACTGCAC TTTCCTCGGC CCGGTCCGGG GTGGATGCTC GGATCAACCT GGCCAACGCT
CAAGGTGGCA TCCACGGTCG TAGGATTTTT TACCAGTGGC GCGATGACGC GAGTTCCTCG
TCACAGAACG CGCTGGCCAC TCAGGATCTT GTGCAACAAG AATCCGTGTT CGGTCTCGTG
GCAGCTACCG CCTCGCTCGA AGGTTCACTG GCTCGTCTCG ACGCGCAGGG CATCCCCGTG
GTAGGTATCG CGCTTCCGTC TTGGAACAGG TATCGTAACC TTTTTTCCCA CCTCTACATG
CCCTCTCCGG GAACGGTCGC CCGCTATATC CAGGCCCACG GTGGGACGAG AATCGCTGTT
GTTACCACCG GCACTGTAGC CTTAACCATG GAGACCATCA CTCAGTACAA AAATGCCTTC
AGCGCTCTCG GACTCGCCGC GACCGATCCC ATCCCGTACA CGAGCAGTAG CGACAGCCCG
CAGCGCATCG TCCACCAGCT CGCGGCTATC CACGCCGATG CCCTGATCGG CTTCACCGCG
CCAGAGGATC TCGCTGACAT CGCGCAGGCA GCTCGTGCCG CGAACCTACG CCTGAACGTC
GACGTTTCCC TGACCGGCTA TGACAAAGCC CTTCTTCCCG CGTTCGGCCA AGCGCTAGCT
GGTGTGTCTA TCCCCGTGTA TTTTCGTCCG TTCGAGGCGG GAGGCCCAGC CATCGACCGC
TACCGCGATG CAATGACACT CTACGCACCT GAAAGCATCG AACCCGACCA GCAGTTCGCC
ATGCTCGCCT ACATATACAC CGACCTGTTC CTACACGGAC TTGACCTAGC CGGCACCTGC
CCAACCCGTG AAGGGTTCAT CAAGGCCCTA CGGGGTGTCA CCGATTATGA TGCGGGCGGT
CTGATCTCGC CCGTCGACCT GAGTGCCAAC TCTACCCGCC CCCTCGATTG TTTCGCGTTC
GTCCGTGTCA ACTCCACCGG CACTGCGTTC GACGTCGTAC ATCAACGACT CTGCTCCGAC
GGCTCGGAAT CCCTGCCGCC GGGAAATGAA TCCACGCCAA CCGGCCGTAG TCGGTAA
 
Protein sequence
MIFGRLSIRW IVSVAIVVVV AGCTLPGSGK ERPVAVGCDS PGVTSDQVKL GLVFSDSGIG 
STALSSARSG VDARINLANA QGGIHGRRIF YQWRDDASSS SQNALATQDL VQQESVFGLV
AATASLEGSL ARLDAQGIPV VGIALPSWNR YRNLFSHLYM PSPGTVARYI QAHGGTRIAV
VTTGTVALTM ETITQYKNAF SALGLAATDP IPYTSSSDSP QRIVHQLAAI HADALIGFTA
PEDLADIAQA ARAANLRLNV DVSLTGYDKA LLPAFGQALA GVSIPVYFRP FEAGGPAIDR
YRDAMTLYAP ESIEPDQQFA MLAYIYTDLF LHGLDLAGTC PTREGFIKAL RGVTDYDAGG
LISPVDLSAN STRPLDCFAF VRVNSTGTAF DVVHQRLCSD GSESLPPGNE STPTGRSR