Gene Franean1_3872 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3872 
Symbol 
ID5672235 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4602129 
End bp4603055 
Gene Length927 bp 
Protein Length308 aa 
Translation table11 
GC content72% 
IMG OID641242750 
Productbinding-protein-dependent transport systems inner membrane component 
Protein accessionYP_001508170 
Protein GI158315662 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0600] ABC-type nitrate/sulfonate/bicarbonate transport system, permease component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0743241 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0892137 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCGACC TGCTGCCCTC GGGCGCGTCC CGCGCCGGGG CGCCCGACCT GGCCGGCGGG 
GGGCCTGGTC TGGCCGGGGC GGCCGCCGCT CCGGGCGCCG GGGCGGACCT GCCGCCGGCG
CCCGGCCTCG TCCGCCCGCG GGCGTCACGG CGGCACACGC ACGGGCGCGG GGCGTCACTG
GCGCTGCGCG CTCTGCTGCC CCTCATCCTG TTCGGCCTGT GGTGGTGGGG CACCGAGGCG
GGCTGGATCT CGTCCGACGT TCTCGTCTCA CCCCCGCGGA TGGTCGAGAC CTTCGGTGAC
CTGGTGCGCG AGGACGAGTT GTTCCACCAG GTGTCGGTGT CGCTCGACCT GGCCCTGCGC
GGGGCGCTGT TCGGGGCAGC GGCCGGTTTG CTGTTCGGTG TCGTCGCCGG CCTGTGGCGG
ATCGGCGAGG AGCTGCTCGA CGCGGTGCTG CAGATGCTGC GGACCATTCC CTTCCTCGCG
GTCGTGCCGC TGTTCATCGT CTGGCTCGGC ATCGGCGACC TGCCGAAGGT GTTGCTGATC
TCGCTCGCCA CGCTGTTCCC GATGTACCTG AACACCTACA ACGGTGTCCG CAACGTGGAC
CGCCGGGTCA TCGAGGCCAT GGAGGTGTTC GGCCTGCGCG GGGCCCGGCT CGTGCTCACA
GTGATCATTC CGCTGGCGCT GCCGTCGATC CTCACCGGGC TGCGGTACTG CCTCGGGATC
TCCGTCCTCG CGCTCATCGC CGCCGAGCAG ATCAACTCCA GCGCCGGCCT CGGCTACCTC
ATGTACCAGG CGCAGTCGAT GCAGCAGGTC GACGTCCTGG TGGTGGTGCT GGCCATCTAC
GCCGTGCTCG GGCTCCTGTC GGACCTGGTG GTCCGGGTGC TCGAACGCCT GCTGATGCCG
TGGCACCGCG GCCTGGCCGT CCGATGA
 
Protein sequence
MTDLLPSGAS RAGAPDLAGG GPGLAGAAAA PGAGADLPPA PGLVRPRASR RHTHGRGASL 
ALRALLPLIL FGLWWWGTEA GWISSDVLVS PPRMVETFGD LVREDELFHQ VSVSLDLALR
GALFGAAAGL LFGVVAGLWR IGEELLDAVL QMLRTIPFLA VVPLFIVWLG IGDLPKVLLI
SLATLFPMYL NTYNGVRNVD RRVIEAMEVF GLRGARLVLT VIIPLALPSI LTGLRYCLGI
SVLALIAAEQ INSSAGLGYL MYQAQSMQQV DVLVVVLAIY AVLGLLSDLV VRVLERLLMP
WHRGLAVR