Gene Franean1_0462 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0462 
Symbol 
ID5668883 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp546324 
End bp547541 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content62% 
IMG OID641239393 
Producthypothetical protein 
Protein accessionYP_001504831 
Protein GI158312323 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0683] ABC-type branched-chain amino acid transport systems, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCCTTAG ATGAATTGAC AAGATGGCTT GGAGCGGCGG CGGTTGCTGC GCTTACGGTT 
GCGGTGACTG GGTGCTCGGT AGTGGGAGGT GGAGGGGATG CGCCAGTGGC CGCCTGCGAA
AGCCCCGGGG TCACGGCCGA CAAGGTGCAC GTCGGTTTCG TCTTCTCCGA CTCGGGTAGC
GGGAGCTCCG CGCTCTCCTC CGCCCGTGCC GGGGTCGATG CCAGGATCGG CCTGGCCAAC
GATGCCGGGG GGGTGAACGG CCGCCACATC GTTTACGACT GGCGCGACGA TGCCGGGTCG
CCGTCGCAGA ATGCCCGCGT CACCGAGGAA CTGGTCCACG ACGAGTCCGT CTTTGGGCTC
ATATCGGCCA CAGCCGCCGG CAGCGGCTCG CTGGACAGCC TCTCGGCCAT GGGGATTCCG
GTAACCGGCC TCGCGAACCC GACTTGGGCG AAATATCCAA ATCTGTTTGC ATATATGTAC
GATGTCTCCC CCGTAGTCAC CGCTCGTTAC ATCCAGGCAA ACGGCGGAAC GAAAGCGGCT
TTCGTAATGA CCGGGTCGCC GGCTTTCACC GTACAGACCA TCGAGCGGTA CAAGACGGCC
TTCGCCGCTA TCGGTGTCGG CACAACCGAA ACCATCTCCT ACGCGAGCGG CGTCGACAGT
CCGACCCAGG CGGTAGAACG CATGGTGAAC AGTGGTGCTA ACGCCATCGT CGCCTTCACA
ACTCCCAAGG ACCTCGTCGA GATACTGCAG ACCGCGAGTG CCGCTAACCT GCGTTTCTCC
AGCACTGTCT CGGTCACCGG ATACGACCGC GGCGTTCTTA AAACTTACGG ACCGCAGCTC
GCCGGCGTCT CGTTCACCGT GAACTTCCAC CCCTTCGAGA TGCGGAATGC CTCCATGGAT
CGGTATCGCG ACGCCATGAC GCGATTCGCT CCGGAGAGCA GCGTACCCGA ACAACAGTTC
GCACTCTACG GCTATCTGTA CACCGACCTG TTCATCCGCG GCCTCGAGCT GGCCGGTGAA
TGCCCCACGC GCGAGGGATT CATCAAATCT CTGCGAAAAG TAACTGACTA CGATGCAGGC
GGCCTGATAG AACCAGTCGA TCTCAGCACC AATAGAACTC AGCCGCTCCA ATGTAACGCA
TTCGTTCGGA TCAACCCCGA CGGCACCGCC TTCGACATCG CTGGTGCACG GCTGTGCGCT
GACGGTACAG GCGCCTGA
 
Protein sequence
MPLDELTRWL GAAAVAALTV AVTGCSVVGG GGDAPVAACE SPGVTADKVH VGFVFSDSGS 
GSSALSSARA GVDARIGLAN DAGGVNGRHI VYDWRDDAGS PSQNARVTEE LVHDESVFGL
ISATAAGSGS LDSLSAMGIP VTGLANPTWA KYPNLFAYMY DVSPVVTARY IQANGGTKAA
FVMTGSPAFT VQTIERYKTA FAAIGVGTTE TISYASGVDS PTQAVERMVN SGANAIVAFT
TPKDLVEILQ TASAANLRFS STVSVTGYDR GVLKTYGPQL AGVSFTVNFH PFEMRNASMD
RYRDAMTRFA PESSVPEQQF ALYGYLYTDL FIRGLELAGE CPTREGFIKS LRKVTDYDAG
GLIEPVDLST NRTQPLQCNA FVRINPDGTA FDIAGARLCA DGTGA