Gene Franean1_3043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3043 
Symbol 
ID5671422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3576191 
End bp3578062 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content67% 
IMG OID641241941 
Productsiderophore-interacting protein 
Protein accessionYP_001507361 
Protein GI158314853 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG2375] Siderophore-interacting protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGAAGA CCTCACGTGG GCTCACAGTG CACCCGCTGA CCCTGCGAGA GGTCGAGGTG 
GTTCGGGTGG TGGACCTGAC GCCGGGGATG CGACGGGTCA CGTTGGGTGG CGAGCAACTT
CGTGCGTTCA CCTCGGCGAA CGGCTTCCCG CAGCCGGCGT TCGACTCGAC AGGCTTCGAC
GATGACATTC GGTTGGTGTT CTGCTACCCC GATCAGGCCG AGCCGGTGCT GCCAGTGCAG
AAGGAGAAGG GCCTCGAGCT GACGAGGAAC CCCCGGCCGT TGTCAAGGGG CTACACAGTT
CGTCGCTGGA ACCCCGCGAC CGGAGAACTG GACGTGGACT TCGTCAAGCA CGGTATCGGT
GTCGGCAGCA CCTGGGCGTA CCGTACCCAA CCCGGCGACC GCATCCACTT CCACGGCCCG
AGCGCATCGC GGGCGCTACC GCAAGACGCG GACTGGCTTC TTGTCGCCGG CGACGACACA
GCGATCCCCG CGATCGCCCG CCTGCTTGAT GAGTTGCCCG ATGACGCGCG GGCGCAGGTG
TTCGTCGAGG TCGCCGAAGA CGCTCATCGG CTGGAGCTGA GGGCTCTGCC GCGGGTTGAG
GTGACCTGGT TGGTGCGCGA CGGCGCCGAG GCGGGCACGA CCACGCTTCT GGTGGACGCG
GTCAAGAACA GTGGCTGGTG GGACGGCCGA CCGTTCGCGT GGTTCGCGGG AGAGCGCTCG
GCGGTGCGGG ATCTGCGTCG CCATCTGGTC GAGGACCGGA GTGTGCCCAA GGAGGACATC
GAGTTCACCG GGTACTGGCG TCGTGGCGAG GTCATTGCCC TAGAGGCCGA CGGAGCGGTG
CCCGACCCCG AGAAGTCGAT CACGCCCTTC GAGAAGCTCC ATCAACTGAC CGAACTGGTC
CCGCCGATCG CGATCCGCAC CGCCGTCGAG CTGGGCATTC CCGAGCTGAT CTCGCGTGGC
GTCAGCAGCG TGGCGGACTT GGCCATCAGG GCGGATGCCG ACAAGCGGGC GTTGGGCAAG
CTGCTGCGCT ACCTGCATGC TCTGGACGTG CTGACCGAGA CCGAACCGGG CCGCTACGCT
CTGACGCCGG TGGGTGAAGT CCTAACCGTC GAGTTCGTCG CGGACTCCTT GCACCCGTCT
GGGGTGGGAG GCCGGGAGAT GCTCGGCGTC TACGGACTCA CCGAGTCGAT CCGCACTGGC
CGGTCGTCTT ACGCCTCCGT CACGGGTCAG ACTTTTGCTG AGGTGCGGGC CGAGCAGGAC
TACGAGGACC GCTACCTGGA ACGTCTGGCC GAGTTCCAGC ACGCGCTGGC TATGTCGATC
GCCAAATCCA ACCTGCTTAC TGGTGTCCGG CACCTGGTGA TCCACTCCGG TGGCGCGGGC
GTGCAGGCCC GCGAGTTCGT CGCCGCTCAT CCTGATCTGC GGGTCACGAT CTGCGTGCTG
CCCGCCCAGG CCGACTGGCT ACGCCGCGAC TTGCCCGTCA CGATCCCCGA CGAGCAGCAG
CGGGCACGGG TCAGCGTGGT CGAGCAGTCG GTATTCGAGC CCAGCCCCGA ATCGGACGTC
GTGTTCATCA GCCGCGCTTT CAAGGCGCTA CCCGACGCCG ACGCCGCCCA CGCTCTCCGC
CGAGCGTCCG AGAACCTCCT CCCGGGTGGG CGGGTGCTGC TGGTCGAGGA GGTCTTCGAC
ACCGACGACC TCGATGAACA TGACGGCGAA GCCGACCTGA TCGGACTCGC GGTACATGGC
TCCGGTCTAC GCACCGCCGA GGAACTGGAC ACCGTGATCA CCCGGTCAGG GCTCACCCGC
ACAGAGACGC ACATCGTCGG CTGGGGCACC ACCGTCCACG AACTCGTCCG CAACAACACC
AACTGCCCCT GA
 
Protein sequence
MPKTSRGLTV HPLTLREVEV VRVVDLTPGM RRVTLGGEQL RAFTSANGFP QPAFDSTGFD 
DDIRLVFCYP DQAEPVLPVQ KEKGLELTRN PRPLSRGYTV RRWNPATGEL DVDFVKHGIG
VGSTWAYRTQ PGDRIHFHGP SASRALPQDA DWLLVAGDDT AIPAIARLLD ELPDDARAQV
FVEVAEDAHR LELRALPRVE VTWLVRDGAE AGTTTLLVDA VKNSGWWDGR PFAWFAGERS
AVRDLRRHLV EDRSVPKEDI EFTGYWRRGE VIALEADGAV PDPEKSITPF EKLHQLTELV
PPIAIRTAVE LGIPELISRG VSSVADLAIR ADADKRALGK LLRYLHALDV LTETEPGRYA
LTPVGEVLTV EFVADSLHPS GVGGREMLGV YGLTESIRTG RSSYASVTGQ TFAEVRAEQD
YEDRYLERLA EFQHALAMSI AKSNLLTGVR HLVIHSGGAG VQAREFVAAH PDLRVTICVL
PAQADWLRRD LPVTIPDEQQ RARVSVVEQS VFEPSPESDV VFISRAFKAL PDADAAHALR
RASENLLPGG RVLLVEEVFD TDDLDEHDGE ADLIGLAVHG SGLRTAEELD TVITRSGLTR
TETHIVGWGT TVHELVRNNT NCP